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Method of producing permuteins by scanning 

permutagenesis 



Priority 

The present application claims priority under Title 35, United States Code, 
§ 119 of United States Provisional Application Serial No. 60/101,908, filed 
September 25, 1998. 

Field of the invention 

A method of producing circularly-permuted proteins (permuteins) by 
scanning permutagenesis comprises making and inserting a series of circularly- 
permuted genes into a display vector, expressing these genes such that the gene 
products are localized to the surface of the display vector, generating a library of 
display vectors presenting the permuted protein, affinity-selecting the display 
vectors with a target protein that can bind the permuted protein, isolating and 
analyzing clones of selected display vectors to identify the circularly-permuted 
protein. The invention further discloses methods of expressing and uses of 
permuteins. 

Background of the invention 
Protein permutagenesis 

Circularly permuted proteins are made by reordering the primary sequence 
of a parent protein. The amino and carboxy terminal ends of the parent protein 
are joined by a peptide linker and new amino and carboxy terminal ends are 
generated at other positions in the sequence. This technique of generating 
variants has been applied to a wide variety of proteins (Table 1). 

Circularly permuted proteins, in many cases, are structurally and 
functionally similar to their non-permuted parent molecule after they undergo 
refolding. The information necessary to direct the folding of proteins into tertiary 
structures is present in secondary structural domains. Vectorial folding of proteins 
from their native amino to carboxy ends is not often observed. The ability of 
permuteins to retain structural and functional properties is remarkable, extending 
earlier observations on the plasticity of proteins with respect to amino acid 
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substitutions (Olins P.O. et aL, J. Biol. Chem. 270: 23754-23760, 1995; Lowman 
and Wells, J. Mol. Biol. 234: 564-578, 1993) and short amino acid insertions 
(Sondek. J. and D. Shortle, Proteins 7: 387-393, 1990; Shortle, D. and J. Sondek, 
Curr. Opin. Biotechnol. 6: 299-305). 

5 Protein sequence reorganization 

Rearrangements of DNA sequences serve an important role in evolution by 
generating a diversity of new proteins differing in structure and function. Gene 
duplication and exon shuffling, for example, generate diversity and provide 
organisms with a competitive advantage since the basal mutation rate is low 
10 (Doolittle, Protein Science 1: 191-200, 1992). 

Recombinant DNA methods have facilitated studies on the effect of 
sequence transposition on protein folding, structure, and function. The first 
rearrangement of proteins using this approach was described by Goldenberg and 
Creighton (J. Mol. Biol. 165:407-413, 1983). A new N-terminus is selected at an 
15 internal site (breakpoint) of the original sequence, the new sequence having the 

same order of amino acids as the original from the breakpoint until it reaches an 
amino acid that is at or near the original C-terminus. At this point the new 
sequence is joined, either directly or through an additional portion of sequence 
(linker), to an amino arid that is at or near the original N-terminus, and the new 
20 sequence continues with the same sequence as the original until it reaches a point 

that is at or near the amino acid that was N-terminal to the breakpoint site of the 
original sequence, this residue forming the new C-terminus of the chain. Similar 
approaches have also been used in other studies (Cunningham et al., Proc. Natl. 
Acad. Sci. U.SJL 76:3218-3222, 1979; Teather & Erfle, J. Bacterial. 172: 3837- 
25 3841, 1990; Schimming et al., Eur. J. Biochem. 204: 13-19, 1992; Yamiuchi and 

Minamikawa, FEBS Lett. 260:127-130, 1991: MacGregor et al., FEBS Lett. 
378:263-266, 1996). 

These general approaches have been applied to proteins which range in 
size from 58 to 462 amino acids (Goldenberg & Creighton, J. Mol. Biol. 165:407- 

30 413, 1983; Li & Coffino, Mol. Cell. Biol. 13:2377-2383, 1993). The proteins 

represent a broad range of structural classes, including proteins that contain 
predominantly alpha helix (interleukin-4; Kreitman et al., Cytokine 7:311-318, 
1995), beta sheet (interieukin-1; Horfcck et al., Protein Eng. 5:427-431, 1992), or 
mixtures of the two types of secondary structures (yeast phosphoribosyl 

35 anthranilate isomerase; Luger et al., Science 243:206-210, 1989). 
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Although broad categories of protein function are represented in these 
sequence reorganization studies, the results of these studies have been highly 
variable. In many cases substantially lower activity, solubility, or thermodynamic 
stability were observed (E. coli dihydrofolate reductase, aspartate 
5 transcarbamoylase, phosphoribosyl anthranilate isomerase, glyceraldehyde-3- 

phosphate dehydrogenase, ornithine decarboxylase, ompA, yeast phosphoglycerate 
dehydrogenase). In other cases, the sequence rearranged protein appeared to have ~ 
many nearly identical properties as its natural counterpart (basic pancreatic 
trypsin inhibitor, T4 lysozyme, ribonuclease Tl, Bacillus 0-glucanase, interleukin- 

10 IP, a-spectrin SH3 domain, pepsinogen, interleukin-4). In exceptional cases, an 

unexpected improvement over some properties of the natural sequence was 
observed, e.g., the solubility and refolding rate for rearranged a-spectrin SH3 
dom ain sequences, an d the receptor affinity and anti-tumor activity of transposed 
interleukin -4-Pseudomonas exotoxin fusion molecule (Kreitman et aL, Proc. Natl. 

15 Acad. Sci. U.SJL 91:6889-6893, 1994; Kreitman et aL, Cancer Res. 55:3357-3363, 

1995). 

The primary motivation for reorganization studies has been to study the 
role of short-range and long-range interactions in protein folding and stability. 
Sequence rearrangements of this type convert a subset of interactions that are 

20 long-range in the original sequence into short-range interactions in the new 

sequence, and vice versa. The fact that many of these sequence rearrangements 
are able to attain a conformation with at least some activity is persuasive evidence 
that protein folding occurs by multiple folding pathways (Viguera et aL, J. Mol. 
Biol. 247:670-681, 1995). In the case of the SH3 domain of alpha-spectrin, 

25 choosing new termini at locations that corresponded to beta hairpin turns resulted 

in proteins with slightly less stability, but which were nevertheless able to fold. 

The positions of the internal breakpoints used in the studies cited above 
are found exclusively on the surface of proteins, and are distributed throughout the 
linear sequence without any obvious bias towards the ends or the middle (the 

30 variation in the relative distance from the original N-terminus to the breakpoint is 

ca. 10 to 80% of the total sequence length). The linkers connecting the original N- 
and C-termini in these studies have ranged from 0 to 9 residues. In one case (Yang 
8l Schachman, Proc. Natl Acad. ScL UJS.A. 90:11980-11984, 1993), a portion of 
sequence has been deleted from the original C-terminal segment, and the 

35 connection made from the truncated C- terminus to the original N-terminus. 

Flexible hydrophilic residues such as Gly and Ser are frequently used in the 
linkers. Viguera et bL(J. Mol. Biol. 247:670-681, 1995) compared joining the 



WO 00/18905 PCT/US99/20891 

4 

original N- and C- termini with 3- or 4-residue linkers; the 3-residue linker was 
less thermodynamically stable. Protasova et al. (Protein Eng. 7:1373-1377, 1994) 
used 3- or 5-residue linkers in connecting the original N-termini of E. coli 
dihydrofolate reductase; only the 3-residue linker produced protein in good yield. 

Protein permutagenesis can be used to optimize the activity of fusion 
proteins or proteins conjugated to other molecules. A fusion between interleukin-4 _ 
(IL-4) and Pseudomonas exotoxin has been permuted resulting in a protein that 
has the first amino acid of the IL-4 domain at position 38 and the new carboxy end 
occurs at amino arid position 37 (Kreitman, R. J. et al., Proc Natl Acad Sci USA 91: 
6889-6893, 1994). The permuted fusion has increased affinity for the IL-4 receptor, 
increased cytotoxicity to IL-4 receptor bearing renal carcinoma cells, and increased 
anti-tumor activity in a murine model, compared to the non-permuted parent 
fusion protein (Kreitman, R. J. et al., Proc Natl Acad Sci USA 91: 6889-6893, 1994; 
Kreitman, R. J. et al., Cancer Res. 55:3357-3363, 1995; Puri, R. K. et al., Cellular 
Immunol. 171: 80-86, 1996). Increased potency of the permuted molecule is 
believed to result from a reduction in steric interference between the IL-4 domain 
in the parent molecule and its receptor. 

Steric hindrance is likely to be a concern for other chimeric proteins which 
interact with receptors through a relatively large area of their surface. The same 
issue also arises with bioconjugates, containing relatively small chemicals 
conjugated to proteins or other molecules in complex polymers (Rose, K et al., 
Molecular Immunology 32: 1031-1037, 1995). 

Phage display methods 

Display methods allow affinity selection of protein variants from a library 
of displayed proteins or peptides (Clackson, T. and J A. Wells, Tibtech 12: 173-184, 
1994; Winter, G., Drug Development Res. 33: 71-89, 1994). Many biological 
entities can be used in display methodologies (so-called "genetic packages" for 
presentation, including bacterial and eukaryotic cells, various eukaryotic and 
prokaryotic viruses, and spores), but the most commonly used vehicles used for 
) display are filamentous bacteriophage, as used herein. We envision the possibility 

that a genetic package other than the particular phage used here could be used to 
present libraries of permuteins, and if so, constitute essentially the same 
invention. 
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Foreign proteins are presented on the surface of a phage particle, and the 
gene encoding the foreign protein is encapsulated in the virion. Because they are 
linked by the phage particle, affinity isolation of the presented protein also leads to 
affinity isolation of the corresponding genes. Extremely large libraries of phage 
5 presented proteins are constructed and affinity screened very rapidly. From the 

standpoint of how quickly mutant proteins can be made and screened for activity, 
phage display is the most efficient mutagenesis technique currently available. 

Functional properties of permuteins 

Permuteins can have improved biological properties by acting through , 
10 several mechanisms. The permutein acting on the same type of cell as its parent 

molecule, may have increased binding, or other action, by virtue of increased 
avidity. Dimers or higher order multimers of these proteins with themselves or 
other chemical groups, including proteins, can have increased efficacy or potency, 
or both. 

15 Permuteins can also have improved therapeutic properties through a 

variety of mechanisms such as: (1) alterations in the overall on- or off-rates or K. or 
K„ of the ligand(s) on the target cell; (2) activation or blockade of complementary 
receptor signaling pathways; and/or (3) more specific targeting of to the cell of 
interest. The permuteins may also possess a unique pharmacokinetic distribution 

20 % and clearance profile (Dehmer et al., Circulation, 91, 2188-2194, 1995; Tanaka et 
al.,. Nature Medicine, 3, 437-442, 1997). 

Permuteins can also have improved properties in vivo, compared to the two 
components individually, as a result of alterations in biodistribution or half-life. 
The improved properties can also result from the binding of the permutein to one 
25 or more of the receptors, pharmacokinetics, or uptake of the permutein is altered in., 

a favorable manner. 

Molecular biology approaches have traditionally been used to permute 
proteins (Horlick, RA. et al., Protein Engineering 5: 427r431, 1992) although 
.. chemical approaches have been used to make small permuted proteins 
30 (Goldenberg, D. P. and T. E. Creighton, J. Mol. Biol. 165: 407-413, 1983). These 

approaches are relatively labor intensive, limiting the number of permuteins that 
can be generated and efficiently screened for the desired biological activities. 
Rapid methods of generating permuteins, coupled with efficient methods for 
screening are needed that will result in the identification of novel active molecules. 
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Summary of the invention 

The present invention is an improved method for generating permuteins 
(scanning permutagenesis) based on the display of proteins on bacteriophage 
surface proteins. Phage display is a powerful, yet convenient tool, traditionally 
5 used for mutagenesis and screening (Clackson, T. and JA. Wells, Tibtech 12: 173- 

184, 1994). Improvements to this technology allow the rapid generation and - 
screening of libraries of permuteins. Variables, such as position of the new termini 
and the length and composition of peptide linkers can easily be varied to generate 
libraries of the desired diversity. 

L0 The present invention relates to methods of producing biologically-active 

circularly permuted proteins of the formula C'-L'-N 1 , derived from a parent protein 
of the formula N'-C 1 , wherein C l is comprised of a segment derived from the 
carboxy portion of said parent protein; N 1 is comprised of a segment derived from 
the amino terminal portion of said parent protein; and L 1 is a chemical bond or a 

15 linker, linkin g C 1 to the amino terminus of L 1 and carboxy terminus of L 1 to the 

amino terminus of N 1 ; comprising the steps of: (a) making a series of circularly- 
permuted genes; (b) inserting said circularly-permuted genes into a display vector; 
(c) expressing said circularly-permuted genes such that the proteins encoded by 
said genes are presented on the surface of the display vector; (d) generating a 

20 library of display vectors presenting the expressed circularly permuted protein; (e) 

affinity-select the presenting display vectors with a target protein that can bind a 
biologically-active circularly-permuted protein; (£) isolate and analyze clones of 
selected display vectors to identify the presented circularly-permuted protein. 

Preferably the method of making a series of circularly-permuted genes is 
25 selected from the group consisting of making a tandemly-repeated intermediate, 

total synthesis of a synthetic gene, assembly of a gene from synthetic 
oligonucleotides, DNA amplification, and limited digestion of a circular 
intermediate. 

Preferably, the display vector is selected from the group consisting of 
30 bacteriophage display vectors, bacteria, and baculovirus vectors. Even more 

preferably the presentation vector is a bacteriophage. Even more preferably, the 
presentation vector is bacteriophage M13. Most preferably, the presentation vector 
is a bacteriophage M13 gene III vector. 
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Preferably the method of making a series of circularly permuted genes is a 
method of making a tandem repeat intermediate. Even more preferably circularly 
permuted genes are amplified from the repeat by gene amplification. 

Preferably the method of affinity selection comprises the steps consisting of 
5 (a) binding said presentation display vectors to a target protein; (b) eluting said 

display vectors; (c) amplifying said display vectors; and (d) biopanning a pool of _ 
said amplified display vectors. 

Preferably, the length of C 1 in the permutein is longer than the length of C 1 
in said parent protein. More preferably, the length of C 1 in the permutein is 
10 shorter than the length of C 1 in said parent protein. Most preferably, the length of 

C 1 in the permutein is the same length as the length of C 1 in said parent protein. 

Preferably, the length of N 1 in the permutein is longer than the length of 
N l in said parent protein. More preferably, the length of N 1 in the permutein is 
shorter than the length of N 1 in said parent protein. Most preferably, the length of 
15 N l in the permutein is the same length as the length of N 1 in said parent protein. 

The invention also contemplates circularly permuted proteins of the 
formula C'-L^N 1 made by the method of scanning permutagenesis. Preferably, the 
DNA sequence encoding said linker L 1 is selected from the group consisting of SEQ 
ID NO: 1 through SEQ ID NO: 368. 

20 Preferably, the circularly-permuted protein is the G-CSF receptor agonist 

domain of a species of mylepoietin (MPO). MPO is one member of a family of novel 
dual cytokine receptor agonists (McKearn, J.P., Myelorestorative activities of 
synthokine and myelopoietin. In Proceedings of the 1996 IBC Conference on 
Therapeutic Applications of Cytokines, pp. 4.3.1-4.3.18, 1996) which are amenable 

25 to manipulation by phage display (Merlin, S. et al., Applied Biochemistry and 

Biotechnology 67: 15-29, 1997; Lee, S.C., Phage presentation for cytokine 
engineering. IBC's Second International Conference on Display Technologies, 
1997). 

Brief description of the figures 
30 Figure 1. Schematic depiction of scanning permutagenesis 

Plate A of Figure 1 shows the strategy to generate a scanning 
pemiutagenesis phage display library. A plasmid containing directly-repeated 
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tandem copies of the hG-CSF gene, for example, is constructed by standard 
methods. The tandem repeat plasmid is used as "the template for PCR 
amplification of genes encoding permuted proteins. Each copy of the G-CSF^ene 
is indicated in light gray (turquoise), and a DNA segment encoding a peptide linker 
5 is indicated in dark gray (red). 

In individual PCR reactions, oligonucleotide primers that initiate PCR 
polymerization at the first nucleotide of a chosen codon of G-CSF, and directing 
polymerization to the end of the tandem construct specifying the carboxy end of the 
protein encoded on the template is annealed to the tandem template. Also, a 

10 second specific primer is also annealed to the template that initiates 

polymerization at the last nucleotide of the codon encoding the amino acid 
immediately preceding the codon where polymerization begins with first primer, 
and which directs polymerization in the opposite direction from that first primer. 
Amplification between these two primers produces a DNA segment encoding a 

15 permuted protein. For example, amplification between the primer indicated by a 

black arrow initiating at codon 2 and the primer indicated by the blue arrow and 
initiating at the codon before 2 (codon 1) produces an amplified gene encoding a 
permuted protein whose amino terminal residue is amino acid 2 of the native 
protein, and whose final amino acid is amino acid 1 of the native protein. 

20 A linker peptide is present between the first and final amino acids of the 

parent protein (residues 1 and 174 in this example). A total of 174 individual 
amplifications would produce a complete collection of all permuted proteins of this 
example. More limited collections containing only a selected set of permuteins can 
be made, as well as more extensive collections made from multiple tandem 

25 template plasmids, each containing a different linker sequence between the first 

and last residues of the two directly repeated tandem gene sequences. The 
collection of amplified segments can then be inserted into a phagemid presentation 
vector by standard methods. Phagemid particles produced from these presentation 
constructs are the scanning permutagenesis phage display library. 

30 Plate B of Figure 1 shows the affinity screening of a phage display library 

(See Clackson, T. and JA. Wells, Tibtech 12: 173-184, 1994; Winter, G., Drug 
Development Res. 33: 71-89, 1994). In this example, a hG-CSF scanning 
permutagenesis library as described in Figure 1A is screened using the hG-CSF 
receptor expressed on mammalian cells as the affinity reagent. In Figure IB, 

35 individual presented proteins are indicated by the shaded circles or diamonds and 

the affinity reagent is indicated by the light gray (pink) rectangles. Presentation 
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library particles are exposed to affinity reagent, unbound particles are washed 
away, and receptor-bound particles are eluted. The eluted particles are amplified 
in E. coli y and the screening cycle is repeated. During any round of the screening 
cycle, the genes encoded (in the present example encoding permuted proteins) by 
5 the selected particles can be expressed and evaluated. 

Figure 2- Pennuteins presented in the scanning pennutagenesis library _ 

Human G-CSF (serl7) protein is depicted as a string of circles, each circle 
corresponding to a single amino acid residue. Amino and carboxy ends of the 
protein are indicated. The amino acids of helical regions are indicated by medium 

10 gray balls, while the amino acids of inter-helical loops are indicated in light gray 

balls (See Hill et aL, Proc. Natl Acad. ScL USA 90: 6167-5171, 1993). Amino ends 
of the pennuteins made for presentation in the library are indicated in dark gray. 
Asterisks indicate the breakpoints of the presented pennuteins which were 
isolated by affinity screening with cells expressing hG-CSF receptor as illustrated 

15 in IB. 

Figure 3. Bioactivity.of pennuteins identified by affinity screening of the 
scanning pennutagenesis library 

Individual pennuteins were expressed transiently in mammalian cells. 
Permeation molecules in the culture supernatants were quantitated by ELISA, and 
20 the proliferative activity of clones was determined using BAF-3-cells dependent on 

G-CSF for growth. The horizontal axis indicate concentration of protein and the 
vertical axis indicate incorporation of tritiated thymidine. 

Definitions k 

The following is a list of abbreviations and the corresponding meanings as 
25 used interchangeably herein: 

g = gram(s) 

mg = milligram(s) 

ml and mL = milliliters) 

RT = room temperature 
30 ug and pg = microgram(s) 

uL and pi = microliters) 



The following is a list of definitions of various terms used herein: 
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The term "permutein" means a circularly-permuted protein: a protein in 
which the amino and carboxy ends of the parent protein are joined together by a 
peptide linker sequence of zero or more amino acids. The amino and carboxy ends 
of the permuted protein occur at amino acids within the parental sequence. 

5 The terms "chemical ligation" and "conjugation" mean a chemical reaction 

which covalently links two similar or dissimilar functional groups together _ 
intramolecularly or intermolecularly. 

The term "peptide linker' means a compound which forms a carboxamide 
bond between two groups having one or more peptide linkages <CONH-) and serves 
10 as a connector for the propose of amelioration of the distance or space orientation 

between two molecules. 

The term "native sequence" refers to an amino acid or nucleic acid sequence 
which is identical to a wild-type or native form of a gene or protein. 

The terms "mutant amino acid sequence," "mutant protein", "variant 
15 protein", "mutein", or "mutant polypeptide" refer to a polypeptide having an amino 

acid sequence which varies from a native sequence due to amino acid additions, 
deletions, substitutions, or all three, or is encoded by a nucleotide sequence from 
an intentionally-made variant derived from a native sequence. 

Detailed description of the invention 
20 Determination of the amino and carboxyl termini of permuteins 

The present invention encompasses circularly permuted-proteins of the 
formula C'-L'-N 1 prepared by phage display techniques. The polypeptide can be 
joined either directly or through a linker segment. The term "directly" defines 
permuteins in which the polypeptide ends are joined without a linker. Thus L 1 

25 represents a chemical bond or a linker, preferably a polypeptide segment to which 

both C 1 and N 1 are joined, wherein C* is comprised of a segment derived from the 
carboxy portion of the parent protein and N 1 is comprised of a segment derived 
from the amino terminal portion of a parent protein represented by the general 
formula N'-C 1 . Preferably, N' and C 1 in the permuted protein C'-L'-N' are the 

30 same length as in the parent protein N'-C\ but each may be independently shorter 

or longer depending on the desired structural characteristics of the permutein. 
Most commonly L 1 is a linear peptide in which C 1 and N 1 are joined by amide 
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bonds, linking C 1 to the amino terminus of L 1 and carboxy terminus of L 1 to the 
amino terminus of N\ 

Additional peptide sequences may also be added to facilitate purification or 
identification of permuteins (e.g., poly-His). A highly antigenic peptide may also be 
5 added that would enable rapid assay and facile purification of the permuteins by a 

specific monoclonal antibody. 

Determination of the linker 

The linking group (L. 1 ) is generally a polypeptide of between 1 and ~500 
amino acids in length. The linkers joining the two molecules are preferably 

10 designed to (1) allow the two molecules to fold and act independently of each other, 

(2) not have a propensity for developing an ordered secondary structure which 
could interfere with the functional domains of the two proteins, <3) have minimal 
hydrophobic characteristics which could interact with the functional protein 
domains and (4) provide steric separation of C l and N 1 such that C 1 and N 1 could 

15 interact simultaneously with their corresponding receptors on a single cell. 

Typically surface amino acids in flexible protein regions include Gly, Asn and Ser. 
Virtually any permutation of amino acid sequences containing Gly, Asn and Ser 
would be expected to satisfy the above criteria for a linker sequence. Other neutral 
amino acids, such as Thr and Ala, may also be used in the linker sequence. 

20 Additional amino acids may also be included in the linkers due to the addition of 

unique restriction sites in the linker sequence to facilitate construction of the 
multi-functional proteins. 

Preferred L 1 linkers of the present invention include sequences selected 
from the group of formulas: 

25 (SEQ ID NO: 1) through SEQ ID NO: 268) 

Other linkers are also contemplated by the invention. The present 
invention is, however, not limited by the form, size or number of linker sequences 
employed. The only requirement of the linker is that it does not functionally 
interfere with the folding and function of the individual molecules of the multi- 
30 functional protein. 

Utility of permuteins 

Permuteins of the present invention may exhibit useful properties such as 
having similar or greater biological activity when compared to a single factor or by 
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having improved half-life or decreased adverse side effects, or a combination of 
these properties. 

Permuteins which have little or no activity maybe useful as antigens for 
the production of antibodies for use in immunology or immunotherapy, as probes or 
as intermediates used to construct other useful permuteins. 

The permuteins of the present invention may have an improved 
therapeutic profile as compared to their parent molecules. For example, some 
permuteins of the present invention may have a similar or more potent activities 
relative to other compounds or proteins without having a similar or corresponding 
increase in side-effects. This is particularly true of multifunctional or fusion 
protein therapeutics, where permutation may relieve steric and other hindrances 
that impair the activity of the parent fusion molecules <see Kreitman, R. J. et aL, 
Proc Natl Acad Sci USA 91: 6889-6893, 1994; Kreitman, R. J. et al., Cancer Res. 
55:3357-3363, 1995, for examples). 

A general utility of permuteins is in the area of nanoscale devices described 
alternatively as "nanobiologicaT or "nanobiotechnological." These are nanoscale 
devices containing both precise structure nanomaterials and biological functional 
components (such as proteins). Nanodevices have been the subject of several 
reviews (Lee, S.C., Trends in Biotechnology, 16: 239-240, 1998). 

Nanobiological/nanobiotechnological devices generally contain proteins 
covalently coupled to polymers or other non-biological precise structure materials. 
Issues of steric and other interferences with protein activity are applicable to 
proteins in nanobiological/nanobiotechnological devices and are highly analogous 
to the issues with multifunctional/fusion proteins discussed above. Protein 
permutation is fully expected to offer a viable approach to deal with these 
considerations, just as it does in the case of fusion proteins (Kreitman, R. J. et al., 
Proc Natl Acad Sci USA 91: 6889-6893, 1994; Kreitman, R. J. et al.,., Cancer Res. 
55:3357-3363,1995). 

Examples 

The following examples will illustrate the invention in greater detail, 
although it will be understood that the invention is not limited to these specific 
examples. Various other examples will be apparent to the person skilled in the art 
after reading the present disclosure without departing from the spirit and scope of 
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the invention. It is intended that all such other examples be included within the 
scope of the appended claims. 

General Materials and Methods 

General methods of cloning, expressing, and characterizing proteins are 
5 found in T. Maniatis et aL, Molecular Cloning, A Laboratory Manual, Cold Spring 

Harbor Laboratory, 1982, and references cited therein, incorporated herein by 
reference; and in J. Sambrook et aL, Molecular Cloning, A Laboratory Manual, 2 nd 
edition, Cold Spring Harbor Laboratory, 1989, and references cited therein, 
incorporated herein by reference. 

10 Unless noted otherwise, all specialty chemicals were obtained from Sigma 

Chemical Company (St. Louis, MO). Restriction endonucleases and T4 DNA ligase 
were obtained from New England Biolabs (Beverly, MA) or Boehringer Mannheim 
(Indianapolis, IN). 

Strains, plasmids, and bacteriophage 

15 The bacterial strains used in these studies are listed in Table 1. Plasmids 

and bacteriophage used or constructed in this study are listed in Tables 2 and 3, 
respectively. 

Phage and phagemid stocks were made and manipulated as described 
(Kay, B.K, Winter, J., and McCafferty, J., Phage Display of Peptides and Proteins, 
20 Academic Press, San Diego, California, 1996; Merlin, S. et aL, Applied 

Biochemistry and Biotechnology 67: 15-29, 1997). 

Transformation of K coli strains 

E, coli strains (Table 1), such as DHScc™ (Life Technologies, Gaithersburg, 
MD) and TGI (Amersham Corp., Arlington Heights, IL) are used for 
25 transformation of ligation reactions and are. the hosts used to prepare plasmid 

DNA for transfecting mammalian cells. E. coli strains, such as JM101 (Yanisch- 
Perron et aL, Gene, 33: 103-119, 1985) and MON105 (Obukowicz et aL, AppL and 
Envir. Micr. f 58: 1511-1523, 1992) can be used for expressing the multi-functional 
proteins of the present invention in the cytoplasm or periplasmic space. 

30 DHSct™ Subcloning efficiency cells are purchased as competent cells and 

are ready for transformation using the manufacturers protocol, while both E. coli 
strains TGl and MON105 are rendered competent to take up DNA using a CaCl 2 
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method. Typically, 20 to 50 mL of cells are grown in LB medium (1% Bacto- 
tryptone, 0.5% Bacto-yeast extract, 150 mM NaCl) to a "density of approximately 
1.0 optical density unit at 600 nanometers (OD600) as measured by a Baush & 
Lomb Spectronic spectrophotometer (Rochester, NY). The cells are collected by 

5 centrifugation and resuspended in one-fifth culture volume of CaCL, solution (50 

mM CaCl,, 10 mM Tris-Cl, pH7.4) and are held at 4°C for 30 minutes. The cells are 
again collected by centrifugation and resuspended in one-tenth culture volume of - 
CaCl, solution. Iigated DNA is added to 0.2 mL of these cells, and the samples are 
held at 4°C for 30-60 minutes. The samples are shifted to 42°C for two minutes 

10 and 1.0 mL of LB is added prior to shaking the samples at 37°C for one hour. Cells 

from these samples are spread on plates (LB medium plus 1.5% Bacto-agar) 
containing either ampicillin (100 micrograms/mL, ug/mL) when selecting for 
ampicillin-resistant transformants, or spectinomycin (75 ug/mL) when selecting for 
spectinomycin-resistant transformants. The plates are incubated overnight at 

15 37°C. 

Colonies are picked and inoculated into LB plus appropriate antibiotic (100 
ug/mL ampicillin or 75 ug/mL spectinomycin) and are grown at 37°C while 
shaking. 

DNA isolation and characterization 

20 DNA constructs were made and propagated in E. coli using standard 

molecular biology techniques (Sambrook, J. et aL, Molecular Cloning, A Laboratory 
Manual, 2 nd edition, Cold Spring Harbor Laboratory, 1989). 

Plasmid DNA can be isolated by a number of different methods and using 
commercially available kits known to those skilled in the art. Plasmid DNA is 

25 isolated using the Promega Wizard™ Miniprep kit (Madison, WI), the Qiagen 

QIAwelt Piasraid isolation kits (Chatsworth, CA) or Qiagen Plasmid Midi or Mini 
kit. These kits follow the same general procedure for plasmid DNA isolation. 
Briefly, cells are pelleted by centrifugation (5000 x g), the plasmid DNA released 
with, sequential NaOH/acid treatment, and cellular debris is removed by 

30 centrifugation (10000 x g). The supernatant (containing the plasmid DNA) is 

loaded onto a column containing a DNA-binding resin, the column is washed, and 
plasmid DNA eluted. After screening for the colonies with the plasmid of interest, 
the E. coli cells are inoculated into 50-100 ml of LB plus appropriate antibiotic for 
overnight growth at 37°C in an air incubator while shaking. The purified plasmid 

35 DNA is used for DNA sequencing, further restriction enzyme digestion, additional 
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subcloning of DNA fragments and transfection into 22. coli, mammalian cells, or 
other cell types. 

Sequence confirmation 

DNA sequence analysis was performed using the Genesis -2000 DNA 
analysis system using standard methods (Prober et al., Science 238: 336-341, 
1987). 

Purified plasmid DNA is resuspended in dHjO and its concentration is 
determined by measuring the absorbance at 260/280 nm in a Bausch and Lomb 
Spectronic 601 UV spectrometer. DNA samples are sequenced using ABI PRISM™ 
DyeDeoxy™ terminator sequencing chemistry (Applied Biosystems Division of 
Perkin Elmer Corporation, Lincoln City, CA) kits (Part Number 401388 or 402078) 
according to the manufacturer's suggested protocol usually modified by the 
addition of 5% DMSO to the sequencing mixture. Sequencing reactions are 
performed in a DNA thermal cycler (Perkin Elmer Corporation, Norwalk, CT) 
following the recommended amplification conditions. Samples are purified to 
remove excess dye terminators with Centri-Sep™ spin columns (Princeton 
Separations, Adelphia, NJ) and lyophilized. Fluorescent dye labeled sequencing 
reactions are resuspended in deionized formamide, and sequenced on denaturing 
4.75% polyacrylamide-8M urea gels using ABI Model 373A and Model 377 
automated DNA sequencers. Overlapping DNA sequence fragments are analyzed 
and assembled into master DNA contigs using Sequencher DNA analysis software 
(Gene Codes Corporation, Ann Arbor, MI). 

Expression of permuted proteins in mammalian cells 

To obtain sufficient protein for analysis of the bioactivity of individual 
MPO molecuJes containing cphG-CSFs, DNA segments containing individual 
affinity-selected MPO: cphGCSFs were subcloned into a mammalian expression 
vector, and expressed transiently in BHK cells as described below. 

The BHK-21 cell line can be obtained from the ATCC (Rockville, MD). The 
cells are cultured in Dulbecco's modified Eagle media (DMEM/high-glucose), 
supplemented to 2 mM (mM) L-glutamine and 10% fetal bovine serum (FBS). This 
formulation is designated BHK growth media. Selective media is BHK growth 
media supplemented with 453 units/mL hygromycin B (CalBiochem, San Diego, 
CA). The BHK-21 cell line was previously stably transfected with the HSV 
transactivating protein VP16. which transactivates the 1E110 promoter found on 
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the plasmid pMON3359 and pMON3633 and the IE 175 promoter found in the 
plasmid pMON3360B (Hippenmeyer, P.J. and Pegg, L.E., Cum Opin. BiotechnoL 
6: 548-552, 1995). The VP16 protein drives expression of -genes inserted behind the 
IE110 or IE175 promoter. BHK-21 cells expressing the transactivating protein 
5 VP16 are designated BHK-VP16. The plasmid pMON1118 expresses the 

hygromycin resistance gene from the SV40 promoter (Highkin et al., Poultry ScL, 
70: 970-981, 1991). A similar plasmid, pSV2-hph, is available from ATCC. 

BHK-VP16 cells are seeded into a 60 millimeter (mm) tissue culture dish at 
3 X 10 5 cells per dish 24 hours prior to transfection. Cells are transfected for 16 

10 hours in 3 mL of "OPTIMEM"™ (Gibco-BRL, Gaithersburg, MD) containing 10 ug 

of plasmid DNA containing the gene of interest, 3 ug hygromycin resistance 
plasmid, pMON1118, and 80 ug of Gibco-BRL "LIPOFECTAMINE w ™ per dish. 
The media is subsequently aspirated and replaced with 3 mL of growth media. At 
48 hours post-transfection, media from each dish is collected and assayed for 

15 activity (transient conditioned media). The cells are removed from the dish by 

trypsin-EDTA, diluted 1:10, and transferred to 100 mm tissue culture dishes 
con taining 10 mL of selective media. After approximately 7 days in selective 
media, resistant cells grow into colonies several millimeters in diameter. The 
colonies are removed from the dish with filter paper (cut to approximately the 

20 same size as the colonies and soaked in trypsin/EDTA) and transferred to 

individual wells of a 24 well plate containing 1 mL of selective media. After the 
clones are grown to confluence, the conditioned media is re-assayed, and positive 
clones are expanded into growth media. 

Affinity selection and screening of phagemids 

25 Affinity reagent used for the identification of functional MPO molecules 

containing cphG-CSF (MPO: cphG-CSF) species from the library were BHK cells 
expressing the hG-CSF receptor on their surface. The library pool was subjected to 
iterative affinity selection (four rounds) against BHK cells expressing the h-GCSF 
receptor using previously described techniques (Merlin, S. et al., Applied 

30 Biochemistry and Biotechnology 67: 15-29, 1997). Between rounds of selection, 

phage eluted from the affinity reagent were amplified in E. coli (Kay, BJL J. 
Winter, and J. McCofferty, Phage Display of Peptides and Proteins, Academic 
Press, San Diego, California, 1996). 
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Expression of proteins in E. coli 

When large-scale quantities of recombinant protein are desirable for 
structure-function studies, DNA segments containing individual affinity-selected 
MPO:cphGCSFs are subcloned into any of a variety of bacterial plasmid expression 
5 vectors, and expressed as a cytoplasmic product or as a secreted protein in E. colL 

E. coli strain MON105 or JM101 harboring the plasmid of interest are 
grown at 37°C in M9 plus casamino acids medium with shaking in an air incubator 
Model G25 from New Brunswick Scientific (Edison, NJ). Growth is monitored at 
OD^o until it reaches a value of 1.0 at which time nalidixic acid (10 mg/mL) in 0.1 

10 N NaOH is added to a final concentration of 50 ug/mL, for cultures containing 

plasmids with the E. coli recA promoter driving expression of the recombinant 
gene. IPTG is used in place of nalidixic acid, as a chemical inducer to facilitate 
expression from plasmids containing the lac promoter or hybrid lac promoters. The 
cultures are then shaken at 37°C for three to four additional hours. A high degree 

15 of aeration is maintained throughout the culture period in order to achieve 

maximal production of the desired gene product. The cells are examined under a 
light microscope for the presence of inclusion bodies <IB). One mL aliquots of the 
culture are removed for analysis of protein content by boiling the pelleted cells, 
treating them with reducing buffer and electrophoresis via SDS-PAGE (see 

20 Maniatis et al., "Molecular Cloning: A Laboratory Manual", 1982). The culture is 

% centrifuged (5000 x g) to pellet the cells. 

Isolation of Inclusion Bodies 

The cell pellet from a 330 mL E. coli culture is resuspended in 15 mL of 
sonication buffer (10 mM 2-amino-2-(hydroxymethyl) 1,3-propanediol 

25 hydrochloride (Tris-HCl), pH 8.0 + 1 mM ethylenediaminetetraacetic acid (EDTA). 

These resuspended cells are sonicated using the microtip probe of a Sonicator Cell 
Disruptor (Model W-375, Heat Systems-Ultrasonics, Inc., Farmingdale, New York). 
Three rounds of sonication in sonication buffer followed by centrifugation are 
employed to disrupt the cells and wash the inclusion bodies (IB). The first round of 

30 sonication is a 3 minute burst followed by a 1 minute burst, and the final two 

rounds of sonication are for 1 minute each. 

Purification 

The folded proteins can be affinity-purified using affinity reagents such as 
monoclonal antibodies or receptor subunits attached to a suitable matrix. 
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Purification can also be accomplished using any of a variety of chromatographic 
methods such as: ion exchange, gel filtration or hydrophobic chromatography or 
reversed phase HPLC. These and other protein purification methods are described 
in detail (Methods in Enzymology, Volume 182 "Guide to Protein Purification- 
edited by Murray Deutscher, Academic Press, San Diego, California, 1990). 

Protein Characterization - 

The purified protein is analyzed by RP-HPLC, electrospray mass 
spectrometry, and SDS-PAGE. The protein quantitation is done by amino acid 
composition, RP-HPLC, and Bradford protein determination. In some cases tryptic 
peptide mapping is performed in conjunction with electrospray mass spectrometry 
to confirm the identity of the protein. 

Baf-3/G-CSF receptor assay 

Briefly, the mouse lymphoid cell line BafB was transfected with human 
granulocyte colony stimulating factor receptor (hG-CSFR) cDNA. Stable clones of 
Baf3 which expressed the G-CSFR and proliferated in the presence of hG-CSF 
were isolated and used to investigate the activity of human G-CSF receptor 
agonists without the influence of other human cytokine receptor responses. 

The cDNA encoding hG-CSFR (a gift from Dr. Daniel C. Link (Washington 
University, St. Louis, MO) was released from the plasmid pEMCV.Sralpha as a 
HindTIUEcoBI (5" to 3') fragment, gel-purified, and inserted into the mammalian 
cell expression plasmid pcDNA3 (Invitrogen, San Diego, CA). This plasmid 
contains enhancer-promoter sequences from the immediate early gene of the 
human cytomegalovirus (CMV), a bovine growth hormone polyadenylation signal 
and transcription termination sequences, a neomycin resistance gene is present for 
the selection of G418 stable cell clones, and an ampicillin resistance gene for 
selection in E. coli. Ligation mixtures were transformed into E. coli strain TGI 
[delta (lac-pro), supE, thi, hsdD5/F(traD36, proA*B\ lad', tecZdeltaMlfi] and 
plasmid DNA was purified using a Qiagen Midiprep Plasmid Kit. The structure of 
plasmid DNAs containing hG-CSFR were confirmed by restriction enzyme analysis 
and by automated DNA sequence analysis using an ABI sequencing machine. One 
of several plasmids with the correct structure was selected and given the 
designation pMON30298. 



Baf3 cell*, maintained in complete growth medium (RPMI 1640 
supplemented to 10% FBS and 10% Wehi 3B supernatant as a source for mouse IL- 
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3), were seeded at a subconfluent cell density of 10*5 cells/ml in .growth media 
(RPMI 1640 5% FBS; 2 mM L-glutamine) the day prior to the electroporation. The 
cells were collected and rinsed twice in 10 ml serum-free RPMI 1640. The cells 
were diluted to 10*67ml in serum-free RPMI and 1 ml was placed into each 
5 electroporation chamber (Gibco/BRL #1608AJ). SO ug of plasmid DNA was added 

to each chamber and the chamber were incubated on ice for 30 minutes prior to 
electroporation. The cells were electroporated on ice at a capacitance of 800 uF, - 
400V, fast charge, and low ohms in a BRL CellPorator. The cells were immediately 
removed from the chambers and placed into 10 cm dishes containing 10 ml of 
10 growth medium. The cells were allowed to recover for 48 hr in growth media prior 

to selection. 

After the recovery period, the cells were pelleted at 1000 rpm for 10 
minutes, and resuspended into 10 ml of selection medium (growth medium 
containing 800 ug/ml G418 sulfate (Gibco/BRL). The cells were kept in selection 

15 media, being passaged twice weekly, until only a few viable cells could be seen in 

the mock transfected control cell dishes (approximately 2 weeks). After an 
additional 2 weeks in selection media, the cells which had been electroporated with 
the hG-CSFR cDNA had grown to a cell density which allowed them to be tested 
for proliferation in the presence of hG-CSF (Fukunaga, R. et al., EMBO J. 10 (10): 

20 2855-2865, 1991). 

The cell proliferation assay conditions are as follows: Briefly, 25,000 cells 
were plated in a microtiter 96 well plate with or without cytokine in IMDM 
medium supplemented with BSA (50 ug/ml), human transferrin (100 ug/ml), lipid 
(50 ug/ml) 2-mercaptoethanol (50 uM final concentration). Each well was 
25 incubated with 0.5 uCi of ^-thymidine (16 hours) and the incorporated 

radioactivity was measured. Triplicate wells containing Baf3 cells were set up 
with 4 nM hG-CSF, 4 nM mIL-3 or media only control. Samples of different 
permuted proteins were tested in each assay. 

Example 1: Construction of a permutein library without a linker region 



30 



Figure 1 shows a schematic of scanning permutagenesis. A plasmid 
construct comprising a tandem repeat of the modified human granulocyte colony 
stimulating factor (hG-CSF with a serine for amino acid 17) gene joined by a 
sequence (GCCGG, termed a zero order linker) was generated and subcloned into 
the plasmid pACYC177 (Chang, A.C.Y. and S.N. Cohen, J Bacterial. 134: 1141- 
35 1156, 1978) using standard molecular biology methods (Sambrook, J. et al., 
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Molecular Cloning, A Laboratory Manual, 2"" edition, Cold Spring Harbor Press, 
New York, 1989). The resultant plasmid construct (pMON15978) was linearized by 
restriction digestion (Smal) and used as a template for PCR amplification of 
circularly permuted hG-CSF (cphG-CSF) genes, following the method of Horlich 
(Horlick, RA. et al., Protein Engineering 5: 427-431, 1992. For purposes of this 
demonstration of the scanning permutagenesis technique, we chose to make a 
limited permutein library rather than one containing every possible cphG-CSF. 
Figure 2 shows the position of the new amino termini for each new cphG-CSF. 

Individual cphG-CSF genes were inserted into phagemid presentation 
10 vector pCANTAB 5E (Pharmacia Biotech,) such that they were expressed as a part 

of a MPO species (Feng, Y., N. R. Staten, C. M. Baum, N. L. Summers, M. Caparon, 
S. C. Bauer, L. Zurfluh, J. P. McKearn, B. K. Klein, S. C. Lee, C. A. McWherter. 
1997. Multi-functional hematopoeitic receptor agonists. World Patent Application 
WO 97/12985) which was in turn fused to the amino end of the phage gene III 
15 product. The presented fusion protein contained, starting from its amino terminus, 

a hIL-3 receptor agonist, cphG-CSF, and the phage gene HI product. The juncture 
between the presented protein and the gene III product was as previously 
described (Merlin, S. et al., Applied Biochemistry and Biotechnology 67: 15-29, 
1997). 

20 After confirmation of the structure of each phagemid construct, phagemid 

particles were produced for each individual cphG-CSF-presenting species (Merlin 
et al., 1997). Some of these lots of particles were used to individually define the 
affinity properties of specific presented cphG-CSF species in analytical biopanning 
experiments (Caparon, M. H. et al., Molecular Diversity 1: 241-246, 1996; Merlin, 

25 S. et al., Applied Biochemistry and Biotechnology 67: 15-29, 1997), but all of the 

phage particle lots were titered and equivalent numbers of transducing units of 
each particle preparation were pooled together to form the scanning 
permutagenesis library for hG-CSF in an MPO background. Figure 2 shows the 
MPO: cp hG-CSF species present in the library. 

30 mpo : cphG-CSF 38/37 is an example bf the nomenclature used to specify 

the identity of individual permuted proteins. It describes a MPO molecule 
containing a circularly permuted human G-CSF module (with the serine 17 
substitution). The first amino acid of the cphG-CSF domain is amino acid 38 of the 
parent protein, and the last amino acid is residue 37 of the parent. 

35 Example 2: Presentation and Affinity screening of the MPO: cphGCSF 
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library 

MPO: cphG-CSF 38/37, is a full hG-CSF receptor agonist (McKearn, J.P., 
Myelorestorative activities of synthokine and myelopoietin. In Proceedings of the 
1996 IBC Conference on Therapeutic Applications of Cytokines, pp. 4.3.1-4.3.18, 
1996). It was presented on filamentous phage as a positive control to demonstrate 
that permuted proteins can be presented on the surface of phage particles and — 
affinity selected. After phagemid particles were produced from this construct, they 
were subjected to analytical biop aiming using cells expressing the hG-CSF receptor 
as affinity reagent. 

Table 1 shows that phage presented MPO: cphG-CSF 38/37 was affinity 
selected by cells expressing the hG-CSF. MPO: cphGCSF 38/37-GPIII fusion was 
expressed, secreted and assembled into phagemid particles, and could be affinity 
selected by the hG-CSF receptor. Permutagenesis of a protein does not appear to 
impair its successful presentation. 

Relative to typical phage display libraries, the complexities of cp libraries 
are low, containing perhaps hundreds to thousands of individuals. The 
demonstration library here contained about 50 distinct clones, as opposed to more 
typical phage libraries containing more than 10 5 individuals (reviewed in Clackson, 
T. and J A. Wells, Tibtech 12: 173-184, 1994). 

37 randomly chosen selectants from round 1, and fewer from subsequent 
rounds (17, 11 and 14 were picked from rounds 2, 3 and 4, respectively) were 
chosen for sequence analysis. The identity of the MPO: cphG-CSFs identified in 
each round is shown in Figure 2. 

A total of 14 MPO: cphGCSF species were identified from the output of 
affinity selection (Figure 2). Most of the MPO: cphGCSF species identified from 
the library had new carboxy and amino termini in loop segments <9 of 14 
permuteins identified), rather than in clearly defined secondary structures (See 
Hill et al., 1993 for the hG-CSF structure). Five selectants had termini within 
helical domains of hG-CSF (MPO: cphG-CSFs 13/12, 19/18, 71/70, 123/122 and 
159/158). For three of these molecules (MPO: cphG-CSFs 13/12, 71/70 and 
123/122) their new ends lie at the outermost ends of helices, and therefore 
perturbation of secondary structure caused by these permuteins may be minimal. 
However, MPO: cphG-CSF 19/18 and MPO: cphG-CSF 159/158 have new termini 
well within helix 1 and helix 4 of hG-CSF, respectively. 
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These data parallel the observations of <Jraf and Schachman, who 
developed a limited DNase I digestion method for "random" permutagenesis <Graf, 
R. and H. K. Schachman, Proc Natl Acad Sci USA 93:11591-11596, 1996). They 
identified two permutein species of aspartate transcarbamoylase that introduced 

5 new amino and carboxy ends into secondary structural domains and that retained 

biological activity. In their work, the majority of permuteins introducing ends into 
secondary structures (5/7 identified) were significantly diminished in activity. In 
contrast, we found a several permuteins that introduced helical breaks retained 
activity (See Below). The method used by Graf and Schechman frequently 

10 introduces point mutations, small insertions and deletions into the permuted 

proteins, potentially complicating the analysis of the effects of permutagenesis. 

Example 3: Biological activity of MPO: cphG-CSFs selected from the cp 
phage library 

To obtain sufficient protein for analysis of the bioactivity of individual 
15 MPO molecules containing cphG-CSFs, DNA segments containing individual 

affinity-selected MPO: cphGCSFs were subcloned into a mammalian expression 
vector, and expressed transiently in BHK cells as described above. 

The MPO: cphG-CSFs isolated from biopanning were all expressed 
transiently in mammalian cells and the amount of MPO: cphG-CSF in each 
20 supernatant was determined by sandwich hIl-3 ELISA (Olins P.O. et al., «7. Biol. 

Chem. 270: 23754-23760, 1995). The quantitated supernatants were then assayed 
for G-CSF receptor agonist activity in a Baf-3/G-CSF receptor assay (Figure 3, 
Table 4). 

All but one of the transiently expressed MPO: cphG-CSF proteins exhibited 
25 G-CSF activity equivalent to or slightly better than that of the parent MPO 

molecule, including those MPO: cphG-CSFs with new carboxy and amino ends 
within helixes. The permutein encoded by pMON16021 with a breakpoint between 
positions 48 and 49 did not exhibit activity in the G-CSF-dependent proliferation 
assay. These data suggest that most of the proteins isolated from the library are 
30 competent to bind the hG-CSF receptor and produce a proliferation signal. 

All references, patents, or applications cited herein are incorporated by 
reference in their entirety, as if written herein. 
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Tables 



Table 1: Circularly permuted proteins 



Protein 



Reference 



Enzymes 
T4 lysozyme 

dihydrofolate reductase 



ribonuclease Tl 



Bacillus P-glucanase 



aspartate transcarbamoylase 



phosphoribosyl anthranilate 
isomerase 



glyceraldehyde-3-phosphate 

dehydrogenase 
ornithine decarboxylase 
yeast phosphoglycerate 

dehydrogenase 



Zhang et al. t Biochemistry 32:12311-12318 (1993); 
Zhang et al., Nature Struct. Biol. 1:434-438 (1995) 
Buchwalder et al., Biochemistry 31:1621-1630 
(1994); 

Protasova et al., Prot. Eng. 7:1373-1377 (1995) 
Mullins et al., J. Am. Chem. Soc. 116:5529-5533 
(1994); 

Garrett et al., Protein Science 5:204-211 (1996) 
Hahn et al., Proc. Natl. Acad.ScL U.SJL 91:10417- 

10421 (1994) 
Yang and Schachman, Proc. Natl. Acad. 

Transcarbamoylase Sci. U.SA. 90:11980-11984 

(1993) 

Luger et al., Science 243:206-210 (1989); 
Luger et al., Prot. Eng. 3:249-258 (1990) 
Lin et al., Protein Science 4:159-166 (1995) 
Vignais et al., Protein Science 4:994-1000 (1995) 

U & Coffino, Mol. Cell. Biol 13:2377-2383 (1993) 
Ritco-Vonsovici et al., Biochemistry 34:16543- 
16551 (1995) 



Enzyme Inhibitor 

basic pancreatic trypsin inhibitor Goldenberg & Creighton, J. Mol. Biol 165:407-413 

(1983) 



Cytokines 
interleukin-ip 
interleukin-4 



Horiick et al., Protein Eng. 5:427-431 {1992) 
Kreitman et al., Cytokine 7:311-318 (1995) 



Tyrosine Kinase Recognition 
Domain 
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a-spectrin SH3 domain 

Transmembrane Protein 
omp A 

Chimeric Protein 

inteY\eukin-4'Pseudomonas 
exotoxin fusion molecule 



Viguera et al., J. Mol. Biol. 247:670-681 (1995) 

Koebnik & Kramer, J, Mol. Biol. £50:617-626 
<1995) 

Kreitman et al., Proc. Natl. Acad. Sci. U.S-A. 
91:6889-6893X1994) 



BNSDOCID: <WO 
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Table 2: Strains 



Designation Description or Genotype 



DH5ct™ 


F,pJu80 dZacZdeltaM15, 
delta(ZocZYA-ar^F)U169, deoR, 
recAl, endAl, hsdRll (rk*,mk*), 
phoA, supE44, lambda-, thirl, 
gyrA96 y relAl 


Life Technologies, Rockville, 
Maryland 


•TM101 

(ATCC# 

33876) 


delta (pro lac), supE, tfti, 
F(*raD36, praA*B% JacP, 
ZacZdeltaM15) 


Yanisch-Perron et al. f Gene, 33: 
103-119, 1985 


MON105 

(ATCC# 

55204) 


F, lambda-,IN (rrnD, rmE)l, 
rpoD\ rpoH358 


Obukowicz et al., AppL and Envir. 
M£cr.,58: 1511-1523, 1992 


MON208 


W3110 rpoH358, lad Q , ompTiAan 


Alan Easton 


TGI 


delta(lac-pro), supE, rtM, 
/tsdD5/F(*raD36, proA*B\ iocF, 
tecZdeltaM15) 


Amersham Corp., Arlington 
Heights, Illinois 


W3110 


IN (rrnD-rrnE)l, rp/il 


Lab collection 
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Table 3: Plasmids 



Plasmid 



SEQ Selectable 
ID Marker 
NO. 



Description 



Source 



pACYC177 



Kan R 
Amp^ 



pMON15978 



Amp 1 



pCANTAB 5E 



Amp R 



pMON16016 



Amp 1 



Plasmid with multiple cloning 
sites and two selectable 
markers 



Chang, 
A.C.Y. and 
S.N. Cohen, 
J Bacterid. 
134: 1141- 
1156, 1978 

Plasmid construct comprising a This work 

tandem repeat of the modified 

human granulocyte colony 

stimulating factor <hG-CSF 

with a serine for amino acid 17) 

gene joined by a sequence 

(GCCGG, termed a zero order 

linker), subcloned into the 

plasmid pACYC177 

Phage display vector containing Pharmacia 
lac promoter operably linked to Biotech, 
fd gene 3 signal sequence, a Piscataway, 
linker region, an E tag, and an NJ 
fd gene 3 structural £ene all 
cloned into the vector backbone 
of pUC119 containing ColEl 
ori, the beta lactamase 
resistance ^ene, and an M13 

i 

ori. 

Phagemid presentation vector 
pCANTAB 5E derivation 
containing inserted individual 
cphG-CSF-gene such that it 
was expressed as a part of an 



This work 
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pMON 16017 Amp R 



pMON 16029 Amp R 



pMON16030 Amp R 



pMON16018 Amp R 



MPO species, fused in turn -to 
the amino terminus end of the 
phage genelll product. The 
first amino acid of the cphG- 
CSF domain is amino acid 1 of 
the parent, and the last amino 
acid is residue 174 of the 
parent. The zero order linker is 
attached at the carboxyl end of 
amino acid 174. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 3 of the parent, and the 

last amino acid is residue 2 of 

the parent. 

Identical to pMON1601« except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 7 of the parent, and the 

last amino acid is residue 6 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 9 of the parent, and the 

last amino acid is residue 8 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 11 of the parent, and the 

last amino acid is residue 10 of 

the parent. 
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pMON16019 



Amp* 



pMON16031 



Amp R 



pMON16020 



Amp* 



pMON16032 



Amp 1 



pMON16033 



Amp 1 



pMON16034 



Amp 1 



Identical to pMONr6016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 13 of the parent, and the 

last amino acid is residue 12 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 15 of the parent, and the 

last amino acid is residue 14 of 

the parent. 

Identical to pMONl6016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 19 of the parent, and the 

last amino acid is residue 18 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 22 of the parent, and the 

last amino acid is residue 21 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 27 of the parent, and the 

last amino acid is residue 26 of 

the parent. 

Identical to pMON16016 . except This work 
the first amino acid of the 
cphG-CSF domain is amino 
acid 31 of the parent, and the 
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pMON16035 



pMON16036 



pMON16037 



pMON16038 



pMON16039 



last amino acid is residue 30 of 
the parent. 

Amp R Identical to pMONl6016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 35 of the parent, and the _ 
last amino acid is residue 34 of 
the parent. 

Am p R Identical to pMONl6016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 37 of the parent, and the 
last amino acid is residue 36 of 
the parent. 

AmpR Identical to pMONl60 16 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 38 of the parent, and the 
last amino acid is residue 37 of 
the parent. 

,R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 39 of the parent, and the 
last amino acid is residue 38 of 
the parent. 

Amp R Identical to pMONl6016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 43 of the parent, and the 
last amino acid is residue 42 of 
the parent. 



Amp 



pMON16040 Amp R 



Identical to pMON16016 except This work 
the first amino acid of the 
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P MON16041 



Amp 1 



pMON16022 



Amp R 



pMON16042 



Amp R 



pMON16043 



Amp 1 



pMON16044 



Amp 1 



PCT/US99/20891 

cphG-CSF domain is amino 
acid 45 of the parent, and the 
last amino acid is residue 44 of 
the parent. 

Identical to pMON16016 except This work 
the first amino acid of the _ 
cphG-CSF domain is amino 
acid 47 of the parent, and the 
last amino acid is residue 46 of 
the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 49 of the parent, and the 

last amino acid is residue 48 of 

the parent. 

Identical to pMONl€016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 51 of the parent, and the 

last amino acid is residue "50 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 53 of the parent, and the 

last amino acid is residue 52 of 

the parent. 

Identi£al to pMONl6016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 56 of the parent, and the 

last amino acid is residue 55 of 

the parent. 
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pMON16023 



Amp 1 



pMON16045 



Amp 1 



pMON16024 



Amp 1 



pMON16046 



Amp 1 



pMON16025 



Amp R 



pMON16047 



Amp 1 



Identical to pMONl6016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 60 of the parent, and the 

last amino acid is residue 59 of 

the parent. - 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 64 of the parent, and the 

last amino acid is residue 63 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 67 of the parent, and the 

last amino acid is residue 66 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 69 of the parent, and the 

last amino acid is residue 68 of 

the parent. 

Identical to pMONl6016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 71 of the parent, and the 

last amino acid is residue 70 of 

the parent. 

Identical to pMON16016 except This work 
the first amino acid of the 
cphG-CSF domain is amino 
acid 73 of the parent, and the 
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pMON16048 



pMON16049 



pMON16050 



pMON16051 



pMON16052 



last amino acid is residue 72 of 
the parent. 

Amp R Identical to pMON 160 16 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 84 of the parent, and the 
last amino acid is residue 83 of 
the parent. 

Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 98 of the parent, and the 
last amino acid is residue 97 of 
the parent. 

Amp R Identical to pMONl€016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 100 of the parent, and the 
last amino acid is residue 99 of 
the parent. 

Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 102 of the parent, and the 
last amino acid is residue 101 
of the parent. 

Amp R Identical to pMONlB016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 112 of the parent, and the 
last amino acid is residue 111 
of the parent. 



pMON16053 Amp R 



Identical to pMON16016 except 
the first amino acid of the 



This work 
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pMON16026 



P MON16027 



pMON16054 



pMON16055 



pMON16056 



cphG-CSF domain is amino 
acid 121 of the parent, and the 
last amino acid is residue 120 
of the parent. 

AmpR Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 123 of the parent, and the 
last amino acid is residue 122 
of the parent. 

AmpR Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 125 of the parent, and the 
last amino acid is residue 124 
of the parent. 

AmpR Identical to pMONl6016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 133 of the parent, and the 
last amino acid is residue 132 
of the parent. 

Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 142 of the parent, and the 
last amino acid is residue 141 
of the parent. 

Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 143 of the parent, and the 
last amino acid is residue 142 
of the parent. 
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pMON16057 



Amp 1 



P MON16028 



Amp 1 



pMON16058 



Amp 



pMON16059 



Amp R 



Identical to pMONl€0H6 except This work 

the first amino acid of *he 

cphG-CSF domain is amino 

acid 147 of the parent, and the 

last amino acid is residue 146 

of the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 159 of the parent, and the 

last amino acid is residue 158 

of the parent. 

Identical to pMON16016 except Tkds work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 168 of the parent, and the 

last amino acid is residue 167 

of the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 170 of the parent, and the 

last amino acid is residue 169 

of the parent. 
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Table 4: Analytical biopanning 



Before receptor* 


After receptor* 


Enrichment 


1/6.6x10' 


1/6.5x10' 


990-fold 



* Amp"/Kan H resistant colonies 

Analytical biopanning shows that MPO molecules containing permuted hG- 
CSF domains can be presented and affinity selected in a hG-CSF receptor 
dependent fashion. A mixture of phagemids presenting MPO: cphG-CSF 38/37 
(ampicillin resistant) and M13k07 (kanamycin resistant) were exposed to BHK 
cells with or without the hG-CSF receptor on their surface, washed and eluted 
from the cell surface. Eluted phage were introduced into E. coli and the 
transfected cells were plated on media containing kanamycin or ampicillin. The 
ratio of ampicillin resistant to kanamycin resistant particles were determined prior 
to and following exposure to receptor by counting resistant colonies. 
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Table 5: Activity of selected permuteins 



Plasmid 



Permutein breakpoint Activity in G-CSF- 

in G-CSF amino acid dependent proliferation 
sequence assay 



pMON16017 

pMON16018 

pMON16019 

P MON16020 

pMON16021 

pMON16022 

pMON16023 

pMON16024 

pMON16025 

pMON16026 

P MON16027 

pMON16028 



3/2 
11/10 
13/12 
19/18 
49/48 
60/59 
67/66 
69/68 
71/70 
123/122 
125/124 
159/168 
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Table 6: SEQ ID Number/SEQ ID Name Correlation 



8SQ 
ZD 
HO. 

1. 

2. 

3. 

4 . 

5. 

6. 

7. 

8. 

9. 

iO. 

11. 

12 

13 . 

14. 

15. 

16. 

17. 

18. 

IS. 

20. 

21. 

22. 

23. 

24. 

25. 

26. 

27. 

28. 

29. 

30. 

31. 

32. 

33. 

34. 

3S. 

36. 

37. 

36. 

39. 

40. 

41. 

42. 

43. 

44. 

45. 

46. 

47. 

.48. 

49. 

SO. 

51. 

52. 

S3. 

54. 

55. 

56. 

57. 

58. 

59. 

t*0. 

61 . 

61. 

63. 

64 . 

tl 

66. 

67. 

68. 

69. 

-o. 



FCS1 

FG52 

FCS3 

FCS4 

FCS5 

FGS6 

FGS7 

FGS8 

FGS9 

FCS10 

FCS11 

FGS12 

PCS 13 

FCS14 

FGS15 

FCS16 

FGS17 

FGS18 

FCS19 

FGS20 

FGS21 

FC522 

FCS23 

FG524 

FGS25 

FCS26 

FCS27 

FGS28 

FGS29 

FGS30 

FGS31 

FCS32 

PGS33 

FCS34 

FGS3 5 

FCS36 

FGS37 

FGS38 

FGS39 

FGS40 

FCS41 

FCS42 

FGS43 

FGS44 

FGS4S 

FGS46 

FGS47 

FCS48 

FCS49 

FCS50 

FCS51 

FOSS2 

FQS53 

FGSS4 

FGS55 

FCS56 

FCS5S 
FCS59 
FCS60 
FGS61 
FCS62 
FGS63 
FGS64 
FGS65 
FGS66 
TW 
FOS68 
FGS69 
FCS70 

rvs?. 



CGGGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CCOGCGC 



CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 



CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 

CGCGCGC 



CGCGCGC 
CGCGCGC 
C GCGCG C 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 



CGCGCGC 
CGCGCGC 



ACATC 

ACATG 

ACATC 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATC 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATC 

ACATG 

ACATG 

ACATG 

ACATG 

ACATC 

ACATC 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATC 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATC 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATG 

ACATC 

ACATG 

ACATC 

ACATG 



CGCGCGC 



CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 



CGCGCGC 



CGCGCGC 
CGCGCGC 
CGCGCGC 



74. 
75. 
76. 
77. 
78. 
75. 



FGS73 
FCS74 
FCS75 
FG576 
FGS77 
FCSTf 
FGS79 



CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 
CGCGCGC 



ACATG 
ACATG 
ACATG 
ACATC 
ACATG 
ACATG 
ACATG 
ACATG 
ACATG 
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TATATAT GCGGCCGC GTG 

TATATAT GCGGCCGC GGG 

TATATAT GCGGCCGC CTC 

TATATAT GCGGCCGC CTC 

TATATAT GCGGCCGC CAG 

TATATAT GCGGCCCC CAC 

TATATAT GCGGCCGC CAG 

TATATAT GCGGCCGC GAG 

TATATAT GCGGCCCC TCC 

TATATAT GCGGCCCC CTC 

TATATAT GCGGCCGC AGA 

TATATAT GCGGCCGC CAG 

TATATAT GCGGCCGC GCC 

TATATAT GCGGCCGC GAT 



CGC 


CTT 


CTA 


CGC 


CTT 


CTA 


CCC 


.CAC 


CTA 


CGC 


CAC 


CTT 


ccc 


CAC 


'CTT 


GCC 


CAC 


.CTT 


GCC 


CAG 




GCC 


CAC 


CCC 


GCC 


CAG 


-CCC 


■GA'C 


CAG 


CCC 


GA'C 


ATC 


CCC 


GA'C 


ATC 


CCT 


ccc 


CAC 


CTT 


CCG 


CAC 


.CTT 


CCC 


CAG 


CTT 


CCG 


CAG 


CCC 


GCC 


CAG 


CCC 


A'CT 


CAG 


CCC 


A'CT 


ACT 


CCC 


A'CT 


ACT 


CAT 


CGC 


CAC 


CTT 


CCC 


CAC 


CTT 




CAG 




CCG 


CAG 


CCC 


GCC 


CAG 


CCC 


GGC 




CCC 


GGC 


CGC 


•CCC 


GCC 


CGC 


CGC 


CAT 


CTC 


CCG 


CTG 


CAT 


CTC 


ACC 




CAT 


CTC 


AGA 


•CCC 




CAT 


GTC 




AGC 




CTC 


ACC 


AGC 


CAT 




AGA 


TCT 


AGC 


CAT 


CTC 


TCT 




CAT 


GTC 


TCT 




CAT 


GTC 


TCG 


it __ 


AGC 


CAT 


TCG 


TGT 




CAT 


TCG 


TCT 


AGC 


CAT 


CAA 






AGC 


GCC 




TCC 




AGG 


GCC 


CAA 


TCG 


GCC 


AGC 


CCC 


CAA 


CCT 


CGC 


AGG 


GCC 


GGA 


CCT 


GGC 


AGG 


CAG 


GCA 




GCC 


GGG 


CAG 


GGA 


OCT 


CTC 


CGC 


• CAG 


GGA 


CCT 


CTC 


GGG 


CAG 


GAA 




CTG 


GGC 


CAG 


GAA 


CCT 


CTG 


GAG 




GAA 




CTT 


CAC 


CAG 


GAA 


AGA 




GAG 


CAG 


TAA 


AGA 


CTT 


GAG 


CTC 


TAA 


AGA 


CTT 




CTC 


TAA 


AGA 


CAC 




CTC 


TAA 




CAC 




CTC 




CCT 


CAC 






CTT 


CCT 


CAC 


CTC 


GAT 


CTT 


CCT 


GCC 


-CTG 


GAT 


CTT 


ATC 


CCC 


CTG 


GAT 


GCC 


ATC 






TCC 


• GCC 


ATC 


GCC 


CCC 


TCC 


CCC 


ATC 




CGC 


TCC 


GCC 


CTC 


GAS 


ccc 


TGC 


CTC 


CTG 


GAG 


CGC 


CTT 


CTC 


CTG 


GAG 


CAG 




CTC 


CTG 


ACA 


CAG 


CTT 


CTC 


GCC 


ACA 


CAG 


CTT 




GGC 


ACA 


CAG 


GTA 


CCT 


GGC 


ACA 


CTT 


GTA 


. GGT 


GGC 


CAC 




GTA 


CCT 


GCA 


CAG 


CTT 


GTA 




GCA 


CAG 


CTT 


GGG 




GCA 


CAG 


CTC 


GGG 


GTG 


CCA 


CTC 


CTC 


GGG 


CTG 


CAG 


CTC 


CTC 


GCC 


CAC 


CAG 


CTC 


CTC 


CAG 


CAC 


CAC 


CTC 


GAG 


CAG 


CAC 


CAC 


TCC 


GAG 


CAG 


CAC 


GTG 


TCC 


GAG 


CAC 


AGA 


GTG 


TCC 


CAG 


CAC 


AGA 


CTG 


TCC 


CCC 


CAG 


AGA 


GTC 
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CAC 


CTT 


CCG 


CTT 


GCC 


CAG 


CCG 


CAG 


•CCC 


CAG 


CCC 


GA'C 


CCC 


GA'C 


ATC 


GA'C 


ATC 


CCT 


ATC 


CCT 


ACA 


OCT 


ACA 


CCA 


ACA 


CCA 


TTG 


CAG 


CCC 


ACT 


CCC 


A'CT 


ACT 


A'CT 


ACT 


CAT 


ACT 


CAT 


CCA 


CAT 


CCA 


CCT 


CCA 


CCT 


ATG 


CAC 


CCC 


GGC 


•ccc 


GGC 


GCC 




GGC 


-GGC 


CCC 


GGC 


TCT 


•CGC 


TCT 


GA'C 


TCT 


GA'C 


ATG 


•CCC 


AAG 




ACC 


ATT 




GCC 


GCC 




CTG 


CGC 






ACG 




GCC 


CCC 




GCC' 


CTG 




ACG 


CCT 






CCC 




CTC 


GGG 




CTC 


ACG 




CTC 


AGA 




CAT 


GTC 




AGC 


CAT 




TGT 














TCC 




CM- 


CAA 




AGG 


GCC 




CSC 


AGG 
CGC 




GCA 






CAG 


GGA 






CAG 




.CTG 








CTG 




GAA 






CAG 


GAA 




GAG 


CAG 




•CTT 


GAG 




AGA 


-CTT 




TAA 


AGA 




CTC 


TAA 




TTG 


CTC 




CAC 


TTG 




CCT 


CAC 






CCT 




CAT 






CTC 


GAT 




CCC 


CTG 




ATC 


-ccc 




CCC 


ATC 




TCC 


GCC 




CCC 


TCC 




GAG 


CGC 




CTG 


GAG 




CTC 


CTC 




CTT 


CTC 




CAG 


CTT 




ACA 


CAG 




GCC 


ACA 




GGT 


GGC 




CTA 


GGT 




CTT 


GTA 




CAG 


CTT 




CCA 


CAG 




■ -GTG 


GCA 




GGG 


GTG 




CTC 


GGG 




err 


CTC 




CAG 


CTC 




CAC 


CAS 




CAS 


CAC 




CAG 


CAS 




TZZ 


CAG 
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252. 


RCS57 


TATATAT 


GCGGCCGC 


GGG 


GAT 


GCC 


-CAG 


-ASA 


GTS 


TCC 


2S3. 


RGSS8 


TATATAT 


GCGGCCGC 


CCA 


GGG 


GAT 


GCC 


_ CAG 


ASA 


GTG 


254. 


RGSS9 


TATATAT 


GCGGCCGC 


AGC 


CCA 


GGG 


GAT 


'GCC 


-"CAS 


AGA 


255. 


RGS60 


TATATAT 


GCGGCCGC 


GGG 


AGC 


CCA 


GGG 


■CAT 


GCC 


CAG 


256. 


RCS61 


TATATAT 


GCGGCCGC 


CAG 


GGG 


AGC 


CCA 


GGG 


GAT 


GCC 


25^. 


RGS62 


TATATAT 


GCGGCCGC 


GCT 


CAG 


CGG 


ACC 


CCA 


CGG 


CAT 


258. 


RCS63 


TATATAT 


GCGGCCGC 


GCA 


GCT 


CAG 


GGG 


AGC 


CCA 


CGG 


259. 


RGS64 


TATATAT 


GCGGCCGC 


GCA 


GGA 


GCT 


CAG 


GGG 


AGC 


CCA 


260. 


RGS6S 


TATATAT 


GCGGCCGC 


GGG 


GCA 


GGA 


GCT 


CAG 


GGG 


ACC 


261. 


AGS 6 6 


TATATAT 


GCGGCCGC 


OCT 


GGG 


CCA 


GGA 


CCT 


CAG 


CGG 


262. 


RGS67 


TATATAT 


GCGGCCGC 


CTG 


GCT 


GGG 


GCA 


GGA 


GCT 


CAG 


263. 


RGS68 


TATATAT 


GCGGCCGC 


GGC 


CTG 


GCT 


GGG 


GCA 


GGA 


CCT 


264. 


RGS69 


TATATAT 


GCGGCCGC 


CAG 


GGC 


CTG 


GCT 


GGC 


GCA 


GCA 


265. 


RGS70 


TATATAT 


GCGGCCGC 


CTG 


CAG 


GGC 


CTG 


CCT 


GGG 


CCA 


266. 


RGS71 


TATATAT 


GCGGCCGC 


CAG 


CTG 


CAG 


GCC 


CTG 


GCT 


GGG 


267. 


RGS72 


TATATAT 


GCGGCCGC 


TGC 


CAG 


CTG 


CAG 


GGC 


CTG 


CCT 


268. 


RGS73 


TATATAT 


GCGGCCGC 


GCC 


TGC 


CAG 


CTG 


CAG 


GGC 


■CTG 


269. 


RGS74 


TATATAT 


GCGGCCGC 


GCA 


GCC 


TCC 


CAG 


CTC 


CAG 


GGC 


270. 


RGS75 


TATATAT 


GCGGCCGC 


CAA 


GCA 


GCC 


TCC 


CAG 


CTG 


CAG 


271. 


RGS76 


TATATAT 


GCGGCCGC 


err 


CAA 


GCA 


GCC 


TGC 


CAG 


CTG 


272. 


RGS77 


TATATAT 


GCGGCCGC 


GTT 


GCT 


CAA 


GCA 


CCC 


TGC 


CAG 


273. 


RGS78 


TATATAT 


GCGGCCGC 


GAG 


GTT 


GCT 


CAA 


GCA 


CCC 


TGC 


274. 


RGS79 


TATATAT 


GCGGCCGC 


ATG 


GAG 


GTT 


GCT 


CAA 


CCA 


CCC 


275. 


RGS80 


TATATAT 


GCGGCCGC 


GCT 


ATG 


GAG 


GTT 


CCT 


CAA 


GCA 


276. 


RG581 


TATATAT 


GCGGCCGC 


GCC 


GCT 


ATG 


GAG 


GTT 


GCT 


CAA 


277. 


RGS82 


TATATAT 


GCGGCCGC 


AAG 


GCC 


CCT 


ATG 


GAG 


GTT 


GCT 


278. 


RGS83 


TATATAT 


GCGGCCGC 


GAA 


AAG 


GCC 


GCT 


ATG 


GAS 


•GTT 


279. 


RGS64 


TATATAT 


GCGGCCGC 


GAG 


GAA 


AAG 


CCC 


CCT 


ATS 


GAG 


280. 


RG585 


TATATAT 


GCGGCCGC 


GTA 


GAG 


GAA 


AAG 


GCC 


GCT 


ATG 


281. 


RGS86 


TATATAT 


GCGGCCGC 


CTG 


GTA 


GAG 


GAA 


AAG 


GCC 


GCT 


282. 


RGS67 


TATATAT 


GCGGCCGC 


CCC 


CTG 


GTA 


CAG 


GAA 


AAG 


CCC 


283. 


RGS88 


. TATATAT 


GCGGCCGC 


GAG 


CCC 


CTC 


GTA 


GAG 


GAA 


AAG 


284. 


RG589 


TATATAT 


GCGGCCGC 


CAG 


GAG 


CCC 


CTC 


GTA 


GAG 


CAA 


285. 


RGS90 


TATATAT 


GCGGCCGC 


CTG 


CAG 


GAG 


CCC 


CTG 


GTA 


GAG 


286. 


RGS91 


TATATAT 


GCGGCCGC 


GGC 


CTC 


CAG 


GAG 


CCC 


CTG 


GTA 


287. 


RGS92 


TATATAT 


GCGGCCGC 


CAG 


GGC 


CTG 


CAG 


GAG 


CCC 


CTG 


288, 


RG593 


TATATAT 


GCGGCCGC 


TTC 


CAG 


GCC 


CTC 


CAG 


GAG 


-CCC 


289. 


RGS94 


TATATAT 


GCGGCCGC 


CCC 


TTC 


CAG 


GCC 


CTG 


CAG 


GAC 


290. 


RGS9S 


TATATAT 


GCGGCCGC 


TAT 


CCC 


TTC 


CAG 


GGC 


CTC 


CAG 


291. 


RGS96 


TATATAT 


GCGGCCGC 


GGA 


TAT 


CCC 


TTC 


CAG 


GGC 


CTC 


292. 


RGS97 


TATATAT 


GCGGCCGC 


GGG 


GGA 


TAT 


CCC 


TTC 


CAG 


GGC 


293. 


RGS98 


TATATAT 


GCGGCCGC 


CTC 


GGG 


GGA 


TAT 


CCC 


TTC 


CAG 


294. 


RGS99 


TATATAT 


GCGGCCGC 


CAA 


CTC 


GGG 


GGA 


TAT 


CCC 


TTC 


295. 


RGS100 


TATATAT 


GCGGCCGC 


ACC 


CAA 


CTC 


GGG 


GGA 


TAT 


CCC 


296. 


RGS101 


TATATAT 


GCGGCCGC 


GGG 


ACC 


CAA 


CTC 


GGG 


GGA 


TAT 


297. 


RG5102 


TATATAT 


GCGGCCGC 


GGT 


GGG 


ACC 


CAA 


CTC 


GGG 


GGA 


298. 


RGS103 


TATATAT 


GCGGCCGC 


CAA 


GGT 


GGG 


ACC 


CAA 


CTC 


GGG 


299. 


RGS104 


TATATAT 


GCGGCCGC 


GTC 


CAA 


GGT 


GGG 


ACC 


CAA 


CTC 


300. 


RGS105 


TATATAT 


GCGGCCGC 


TGT 


GTC 


CAA 


GCT 


GGG 


ACC 


CAA 


301. 


RGS106 


TATATAT 


GCGGCCGC 


CAG 


TGT 


CTC 


CAA 


GGT 


GGG 


ACC 


302. 


RGS107 


TATATAT 


GCGGCCGC 


CTG 


CAG 


TGT 


CTC 


CAA 


GGT 


GGG 


303. 


RG5108 


TATATAT 


GCGGCCGC 


CAG 


CTG 


CAG 


TGT 


GTC 


CAA 


GOT 


304. 


RGS109 


TATATAT 


GCGGCCGC 


GTC 


CAG 


CTC 


CAG 


TGT 


■CTC 


CAA 


305. 


RGS110 


TATATAT 


GCGGCCGC 


GAC 


GTC 


CAG 


CTC 


CAG 


TGT 


CTC 


306. 


RGS111 


TATATAT 


GCGGCCGC 


GGC 


GAC 


GTC 


CAG 


"CTC 


CAG 


TGT 


307. 


RGS112 


TATATAT 


GCGGCCGC 


GTC 


GGC 


GAC 


CTC 


CAG 


CTG 


"CAG 


308. 


RCS113 


TATATAT 


GCGGCCGC 


AAA 


GTC 


GGC 


GAC 


GTC 


CAG 


CTG 


309. 


RGS114 


TATATAT 


GCGGCCGC 


GGC 


AAA 


GTC 


GGC 


GAC 


CTC 


CAG 


310. 


RGS115 


TATATAT 


GCGGCCGC 


GGT 


GCC • 


AAA 


GTC 


GGC 


GAC 


CTC 


311. 


RGS116 


TATATAT 


GCGGCCGC 


GGT 


GGT 


GGC 


AAA 


GTC 


GGC 


GAC 


312. 


RGS117 


TATATAT 


GCGGCCGC 


CAT 


GGT 


GGT 


GCC 


AAA 


CTC 


•GGC 


313. 


RG5118 


TATATAT 


GCGGCCGC 


CCA 


GAT 


GGT 


GGT 


GGC 


AAA 


GTC 


314. 


RGS119 


TATATAT 


GCGGCCGC 


CTG 


CCA 


GAT 


GGT 


GGT 


GGC 


AAA 


315. 


RGS120 


TATATAT 


. GCGGCCGC 


CTG 


CTG 


CCA 


CAT 


GGT 


GGT 


GGC 


316. 


RGS121 


TATATAT 


GCGGCCGC 


CAT 


CTG 


CTG 


CCA 


GAT 


GCT 


'GOT 


317. 


RGS122 


TATATAT 


GCGGCCGC 


TTC 


CAT 


CTG 


CTC 


CCA 


GAT 


GGT 


318. 


RGS123 


TATATAT 


GCGGCCGC 


TTC 


TTC 


CAT 


CTC 


CTG 


CCA 


GAT 


319. 


RGS124 


TATATAT 


GCGGCCGC 


CAG 


TTC 


TTC 


CAT 


CTG 


CTG 


CCA 


320. 


RGS125 


TATATAT 


GCGGCCGC 


TCC 


CAG 


TTC 


TTC 


CAT 


CTG 


CTC 


321. 


RGS126 


TATATAT 


GCGGCCGC 


CAT 


TCC 


CAG 


TTC 


TTC 


CAT 


CTG 


322. 


RGS127 


TATATAT 


GCGGCCGC 


GCC 


CAT 


TCC 


CAG 


TTC 


TTC 


-CAT 


323. 


RGS128 


TATATAT 


GCGGCCGC 


AGG 


GGC 


CAT 


TCC 


CAG 


TTC 


TTC 


324. 


RGS129 


TATATAT 


GCGGCCGC 


GGC 


AGG 


GGC 


CAT 


TCC 


CAG 


TTC 


325. 


R3S130 


TATATAT 


GCGGCCGC 


CAG 


GGC 


AGG 


GGC 


CAT 


TCC 


'CAG 


326. 


RGS131 


TATATAT 


GCGGCCGC 


CTG 


CAG 


GGC 


AGG 


GGC 


CAT 


TCC 


327. 


RGS132 


TATATAT 


GCGGCCGC 


GGG 


CTC 


CAG 


GCC 


AGG 


GGC 


CAT 


J NO t 


RSS133 


TATATAT 


GCGGCCGC 


GGT 


GGG 


CTG 


CAG 


CCC 


AGG 


GGC 


329 . 


RGS134 


TATATAT 


GCGGCCGC 


CTG 


GGT 


GGG 


CTC 


CAG 


GGC 


AGG 


330. 


RGS13S 


TATATAT 


GCGGCCGC 


ACC 


CTG 


GGT 


CCC 


CTG 


CAS 


GGC 


331. 


RGS13 6 


TATATAT 


GCGGCCGC 


GGC 


ACC 


CTC 


GGT 


GGG 


•CTG 


CAG 


332. 


RGS13? 


TATATAT 


GCGGCCGC 


CAT 


GCC 


ACC 


CTC 


GGT 


GSS 


CTS 


333. 


*gsi: z 


TATATAT 


GCGGCCGC 


CGG 


CAT 


GGC 


ACC 


CTG 


GGT 


GGG 


334. 


RGZ13> 


TATATAT 


GCGGCCGC 


GGC 


CGG 


CAT 


CCC 


ACC 


CTG 


GGT 


335. 




TATATAT 


GCGGCCGC 


GAA 


-GGC 


CGG 


CAT 


GGC 


ACC 


CTG 




RGS141 


TATATAT 


GCGGCCGC 


GGC 


GAA 


GGC 


CGG 


-CAT 






337. 


RGS142 


TATATAT 


GCGGCCGC 


ASA 


GGC 


GAA 


GGC 


CGG 


CAT 
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338. 

339. 

340. 

341. 

342. 

343. 

344. 

34S. 

346. 

347. 

34B. 

349. 

350. 

351. 

352. 

353. 

354. 

355. 

356. 

357. 

356. 

359. 

360. 

361. 

362. 

363. 

364. 

365. 

366. 

367. 

368. 



RCS143 

RGS144 

RQS145 

RGS146 

RGS147 

RGS148 

RGS149 

RGS1S0 

RGS151 

RGS152 

RG5153 

RGS154 

RGS155 

RG5156 

RCS157 

RGS158 

RGS159 

RGS160 

RGS161 

RGS162 

RGS163 

RGS164 

RG5165 

RGS166 

RGS167 

RGS168 

RGS169 

RGS170 

RG5171 

RGS172 

RGS173 



TATATAT 


GCCGCCGC 


AGC 


AGA 


GGC 


GAA 


GGC 


CCC 


CAT 


TATATAT 


GCCGCCGC 


CAA 


AGC 


AGA 


CGC 


GAA 


CGC 


CGG 


TATATAT 


GCCGCCGC 


CTC 


GAA 


AGC 


AGA 


" GGC 


GAA 


GGC 


TATATAT 


GCCGCCGC 


GCG 


CTC 


GAA 


AGC 


AGA 


GGC 


GAA 


TATATAT 


GCGGCCCC 


CCC 


GCG 


-CTC 


• GAA 


AGC 


AGA 


GGC 


TATATAT 


GCCGCCGC 


TGC 


CCC 


GCG 


CTG 


GAA 


AGC 


AGA 


TATATAT 


GCGGCCCC 


TCC 


TGC 


CCC 


GCG 


CTG 


GAA 


AGC 


TATATAT 


GCCGCCGC 


CCC 


TCC 


TGC 


CCC 


GCG 


CTG 


GAA 


TATATAT 


GCCGCCGC 


GAC 


CCC 


TCC 


TGC 


CCC 


GCG 


CTC 


TATATAT 


GCGGCCCC 


CAG 


GAC 


CCC 


TCC 


TGC 


CCC 


GCG 


TATATAT 


GCGGCCCC 


AAC 


CAG 


GAC 


CCC 


TCC 


TGC 


CCC 


TATATAT 


GCCGCCGC 


AGC 


AAC 


CAG 


GAC 


CCC 


TCC 


TGC 


TATATAT 


GCGGCCCC 


GCT 


AGC 


AAC 


CAG 


CAC 


CCC 


TCC 


TATATAT 


GCGGCCCC 


ATG 


GCT 


AGC 


AAC 


CAC 


GAC 


CCC 


TATATAT 


GCCGCCGC 


CAG 


ATG 


CCT 


AGC 


AAC 


CAC 


GAC 


TATATAT 


GCGGCCCC 


era 


CAC 


ATG 


CCT 


AGC 


AAC 


CAG 


TATATAT 


GCGGCCCC 


CCT 


CTC 


CAG 


ATG 


CCT 


AGC 


AAC 


TATATAT 


GCCGCCGC 


GAA 


CCT 


CTC 


CAG 


ATG 


CCT 


AGC 


TATATAT 


GCCGCCGC 


CAG 


GAA 


CCT 


CTG 


CAG 


ATG 


<SCT 


TATATAT 


GCGGCCCC 


CTC 


CAG 


GAA 


GCT 


. CTC 


CAG 


ATG 


TATATAT 


GCGGCCCC 


CAC 


CTC 


CAG 


GAA 


GCT 


CTG 


CAG 


TATATAT 


GCGGCCCC 


CCA 


CAC 


CTC 


CAC 


GAA 


GCT 


■•CTG 


TATATAT 


GCGGCCCC 


GTA 


CCA 


CAC 


CTC 


CAG 


GAA 


GCT 


TATATAT 


GCGGCCCC 


GCG 


GTA 


CCA 


CAC 


CTC 


CAG 


GAA 


TATATAT 


GCGGCCCC 


AAC 


GCG 


GTA 


CCA 


CAC 


CTC 


CAC 


TATATAT 


GCGGCCCC 


TAG 


AAC 


GCG 


GTA 


CCA 


CAC 


•CTC 


TATATAT 


GCGGCCCC 


GCG 


TAG 


AAC 


GCG 


GTA 


CCA 


CAC 


TATATAT 


GCCGCCGC 


GTG 


GCG 


TAG 


AAC 


GCG 


GTA 


CGA 


TATATAT 


GCGGCCCC 


AAC 


CTC 


GCG 


TAG 


AAC 


GCG 


GTA 


TATATAT 




CGC 


AAC 


GTG 


GCG 


TAG 


AAC 


GCG 


TATATAT 




CTC 


CGC 


AAC 


GTG 


GCG 


TAG 


AAC 



ft 
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CLAIMS 

What is claimed is: 

1. A method for making a biologically-active circularly-permuted protein of 
the formula C*-L l -N l , derived from a parent protein of the formula NM3 1 , 
wherein 

C 1 is comprised of a segment derived from the^arboxy portion of 
said parent protein; 

N 1 is comprised of a segment derived from the amino terminal 
portion of said parent protein; and 

L. 1 is a chemical bond or a linker, linking C 1 to the amino ^terminus 
of L 1 and carboxy terminus of L 1 to the amino terminus of N 1 ; 

comprising the steps of: 

(a) making a series of circularly-permuted -genes; 

(b) inserting said circularly-permuted genes into a display vector; 

(c) expressing said circularly-permuted genes such that the proteins 
encoded by said genes are presented on the surface of the display 
vector; 

(d) generating a library of display vectors presenting the expressed 
circularly permuted protein; 

(e) affinity-selecting the presenting display vectors with a target 
protein that can bind a biologically-active circularly-permuted 
protein; 

(f) isolating and analyzing clones of selected display vectors to identify 
the presented circularly-permuted protein. 

2. The method of claim 1 wherein the method of making a series of 
circularly-permuted genes is selected from the group consisting of 
making a tandemly-repeated intermediate, total synthesis of a 
synthetic gene, assembly of a gene from synthetic oligonucleotides, 
DNA amplification, and limited digestion of a -circular 
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intermediate. 

3. The method of claim 1 wherein said display vector is selected from 
the group consisting of bacteriophage display vectors, bacteria, and 
baculovirus vectors. 

4. The method of claim 3, wherein said presentation vector is 
a bacteriophage. 

5. The method of claim 4, wherein said presentation 
vector is bacteriophage MI3. 



6. 

10 



The method of claim 5, wherein said 
presentation vector is a bacteriophage M13 
gene III vector. 



7. The method of claim 1 wherein said method of making a series of 
circularly permuted genes is a method of making a tandem repeat 
intermediate. 

15 8 The method of claim 7 wherein said circularly-permuted 

genes are amplified from the repeat by gene amplification. 

9. The method of claim 1 wherein said method of affinity-selection 
comprises the steps consisting of: 

(a) binding said presentation display vectors to a target 
20 protein; 

(b) eluting said display vectors; 

(c) amplifying said display vectors; and 

(d) biopanning a pool of said amplified display vectors. 

10. The method of claim 1 wherein L 1 is a linear peptide linker. 

25 li. The method of claim 1 wherein said the DNA sequence encoding 

said linker U is selected from the group consisting ofSEQ ID NO: 
1 through SEQ ID NO: 368. 



12. 



The method of claim 1 wherein the length of the C l in-said 
permutein is shorter than the length of C 1 in said parent protein. 
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13. The method of claim 1 wherein the length of the N 1 in said 
permutein is shorter than the length of N 1 in said parent protein. 

14. A circularly-permuted protein prepared by the method of claim 1. 

15. A circularly-permuted protein of claim 14 comprising the G- 
CSF receptor agonist domain of a species of mylepoietin 
(MPO). 
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IB. Screening a display library 
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SEQLIST.TXT 



SEQUENCE LISTING 

<110> Lee, Stephen C 

<120> Method of Producing Permuteins by 
Scanning Permutagenesis 

<130> C_3041 

<140> 60/ 

<141> 1998-09-25 

<160> 368 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 1 

cgcgcgcaca tgtctacacc attgggccct gccagc 

<210> 2 ' 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 2 

cgcgcgcaca tgtctccatt gggccctgcc agctcc 

<210> 3 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 3 

cgcgcgcaca tgtctttggg ccctgccagc tccctg 

<210> 4 
<211> 36 
<212> DNA 
<2I3> Synthetic 

<400> 4 

cgcgcgcaca tgtctggccc tgccagctcc ctgccc 

<2I0> 5 
<211> 36 
<212> DNA 

<223> Synthetic ' 
<400> 5 

cgcgcgcaca tgtctcctgc cagctccctg ccccag 

<210> 6 

<211> 36 

<212> DNA 

<213> Synthetic 



<400> 6 

cgcgcgcaca tgtctgccag ctccctgccc cagagc 



36 
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<210> 7 - 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 7 

cgcgcgcaca tgtctagctc cctgccccag agcttc 36 

<210> 8 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 8 

cgcgcgcaca tgtcttccct gccccagagc ttcctg 36 

<210> 9 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 9 

cgcgcgcaca tgtctctgcc ccagagcttc ctgctc 36 

<210> 10 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 10 

cgcgcgcaca tgtctcccca gagcttcctg ctcaag 36 

<210> 11 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 11 i 
cgcgcgcaca tgtctcagag cttcctgctc aagtct 36 

<210> 12 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 12 

cgcgcgcaca tgtctagctt cctgctcaag tcttta 36 

<210> 13 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 13 

cgcgcgcaca tgtctttcct gctcaagtct ttagag 36 

<2i0> 14 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 14 

cgcgcgcaca tgtctctgct caagtcttta gagcaa 36 

<21Q> IS 
<211> 36 
<212> DMA 
<213> Synthetic 
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<400> 15 

cgcgcgcaca tgtctctcaa gtctttagag caagtg 

<210> 16 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 16 

cgcgcgcaca tgtctaagtc tttagagcaa gtgagg 

<210> 17 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 17 

cgcgcgcaca tgtcttcttt agagcaagtg aggaag 

<210> 18 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 18 

cgcgcgcaca tgtctttaga gcaagtgagg aagatc 

<210> 19 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 19 

cgcgcgcaca tgtctgagca agtgaggaag atccag 

<210> 20 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 20 

cgcgcgcaca tgtctcaagt gaggaagatc cagggc 

<210> 21 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 21 

cgcgcgcaca tgtctgtgag gaagatccag ggcgat 

<210> 22 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 22 

cgcgcgcaca tgtctaggaa gatccagggc gatggc 

<210> 23 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 23 

cgcgcgcaca tgtctaagat ccagggcgat ggcgca 
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<210> 24 
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<211> 36 
<212> DNA 
<213> Synthetic 



<400> 24 

cgcgcgcaca tgtctatcca gggcgatggc gcagcg 



36 



<210> 25 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 25 

cgcgcgcaca tgtctcaggg cgatggcgca gcgctc 36 

<210> 26 

<211> 36 

<212> DNA 

<213> Synthetic 



<210> 27 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 27 

cgcgcgcaca tgtctgatgg cgcagcgctc caggag 36 

<210> 28 

<211> 36 

<212> DNA 

<213> Synthetic 



<210> 29 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 29 

cgcgcgcaca tgtctgcagc gctccaggag aagctg 36 

<210> 30 

<211> 36 

<212> DNA 

<213> Synthetic 



<400> 26 

cgcgcgcaca tgtctggcga tggcgcagcg ctccag 



36 



<400> 28 

cgcgcgcaca tgtctggcgc agcgctccag gagaag 



36 



<400> 30 

cgcgcgcaca tgtctgcgcc ccaggagaag ctgtgt 



36 



<210> 31 
<211> 36 
<212> DNA 



<213> Synthetic 



<400> 31 

cgcgcgcaca tgtctctcca ggagaagctg tgtgcc 



36 



<210> 32 
<211> 36 
<212> DNA 



<213> Synthetic 



<400> 32 
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cgcgcgcaca tgtctcagga gaagctgtgt gccacc 

<210> 33 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 33 

cgcgcgcaca tgtctgagaa gctgtgtgcc acctac 

<210> 34 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 34 

cgcgcgcaca tgtctaagct gtgtgccacc tacaag 

<210> 35 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 35 

cgcgcgcaca tgtctctgtg tgccacctac aagctg 

<210> 36 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 36 

cgcgcgcaca tgtcttgtgc cacctacaag ctgtgc 



<210> 37 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 37 

cgcgcgcaca tgtctgccac ctacaagctg tgccac 

<210> 38 
<211> 36 
<212> DNA 
<213> Synthetic 



36 



<400> 38 

cgcgcgcaca tgtctaccta caagctgtgc cacccc 

<21Q> 39 
<211> 36 
<212> DNA 
«c213> Synthetic 

<400> 39 

cgcgcgcaca tgtcttacaa gctgtgccac cccgag , 

<210> 40 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 40 

cgcgcgcaca tgtctaagct gtgccacccc gaggag 

<210> 41 
<211> 36 
<212> DNA 
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<213> Synthetic 
<400> 41 

cgcgcgcaca tgtctctgtg ccaccccgag gagctg 

<210> 42 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 42 

cgcgcgcaca tgtcttgcca ccccgaggag ctggtg 

<210> 43 

<211> 36 

<212> DMA 

<213> Synthetic 

<400> 43 

cgcgcgcaca tgtctcaccc cgaggagctg gtgctg 

<210> 44 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 44 

cgcgcgcaca tgtctcccga ggagctggtg ctgctc 

<210> 45 
<21X> 36 
<212> DNA 
<213> Synthetic 

<400> 45 

cgcgcgcaca tgtctgagga gctggtgctg ctcgga 

<210> 46 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 46 

cgcgcgcaca tgtctgagct ggtgctgctc ggacac 

<210> 47 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 47 

cgcgcgcaca tgtctctggt gctgctcgga cactct 

<210> 48 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 48 

cgcgcgcaca tgtctgtgct gctcggacac tctctg 

<210> 49 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 49 

cgcgcgcaca tgtctctgct cggacactct ctgggc 
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<210> 50 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 50 

cgcgcgcaca tgtctctcgg acactctctg ggcatc 36 

<210> 51 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 51 

cgcgcgcaca tgtctggaca ctctctgggc atcccc 36 

<210> 52 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 52 

cgcgcgcaca tgtctcactc tctgggcatc ccctgg 36 

<210> 53 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 53 

cgcgcgcaca tgtcttctct gggcatcccc tgggct 36 

<210> 54 
<211> 36 
<212> ENA 
<213> Synthetic 

<400> 54 

cgcgcgcaca tgtctctggg catcccctgg gctccc 3^6 

<210> 55 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 55 

cgcgcgcaca tgtctggcat cccctgggct cccctg 3* 

<210> 56 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 56 

cgcgcgcaca tgtctatccc ctgggctccc ctgagc 36 

<210> 57 , 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 57 

cgcgcgcaca tgtctccctg ggctcccctg agctcc 36 

<210> 58 
<21X> 36 
<212> DNA 
<213> Synthetic 
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<400> 58 

cgcgcgcaca tgtcttgggc tcccctgagc tcctgc 36 

<210> 59 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 59 

cgcgcgcaca tgtctgctcc cctgagctcc tgcccc 36 

<210> 60 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 60 

cgcgcgcaca tgtctcccct gagctcctgc cccagc 36 

<210> 61 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 61 

cgcgcgcaca tgtctctgag ctcctgcccc agccag 36 

<210> 62 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 62 

cgcgcgcaca tgtctagctc ctgccccagc caggcc 36 

<210> 63 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 63 

cgcgcgcaca tgtcttcctg ccccagccag gccctg 36 

<210> 64 

<2U> 36 

<212> DNA 

<213> Synthetic 

<400> 64 

cgcgcgcaca tgtcttgccc cagccaggcc ctgcag 36 

<210> 65 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 65 

cgcgcgcaca tgtctcccag ccaggccctg cagctg 36 

<210> 66 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 66 

cgcgcgcaca tgtctagcca ggccctgcag ctggca 36 



<210> 67 
<211> 36 
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<212> DNA 
<213> Synthetic 

<400> 67 

cgcgcgcaca tgtctcaggc cctgcagctg gcaggc 36 

<210> 68 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 66 

cgcgcgcaca tgtctgccct gcagctggca ggctgc 36 

<210> 69 
<211> 36 
«c212> DNA 
<213> Synthetic 

<400> 69 

cgcgcgcaca tgtctctgca gctggcaggc tgcttg 36 

<210> 70 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 70 

cgcgcgcaca tgtctcagct ggcaggctgc ttgagc 36 

<210> 71 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 71 

cgcgcgcaca tgtctctggc aggctgcttg agccaa 36 

<210> 72 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 72 

cgcgcgcaca tgtctgcagg ctgc ttgagc caactc 36 

<210> 73 
<211> 36 
<212> DNA 
<213> Synthetic 

c4Ctffc> 72 

frgctpcgcaca tgcctggccg cttgagccaa ctccat 36 
<211> 

<212> DNA , 
<213> Synthetic 

<400> 74 

cgcgcgcaca tgtcttgctt gagccaactc catagc 36 

<210> 73 
<211> 36 
<212> DNA 
<213> Synthetic 



<40O> 75 

cgcgcgcaca tgtctttgag ccaactccat a^cggc 
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<210> 76 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 76 

cgcgcgcaca tgtctagcca actccatagc ggcctt 

<210> 77 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 77 

cgcgcgcaca tgtctcaact ccatagcggc cttttc 

<210> 78 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 78 

cgcgcgcaca tgtctctcca tagcggcctt ttcctc 

<210> 79 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 79 

cgcgcgcaca tgtctcatag cggccttttc ctctac 

<210> 80 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 80 

cgcgcgcaca tgtctagcgg ccttttcctc taccag 

<210> 81 
<2U> 36 
<212> DNA 
<213> synthetic 

<400> 81 

cgcgcgcaca tgtctggcct tttcctctac cagggg 

<210> 82 
<211> 36 
<212> DNA 
< Z I 1- > Eyntftecic 

<400> 82 

cgcgcxczca tgxctctttt cctctaccag gggctc 

<210> 83 
<211> 36 
<212> DNA 
<213> Synthetic 

<40O> 83 

cgcgcgcaca tgtctttcct ctaccagggg ctcctg 

•c210> 84 
~<211> 36 
<212> DNA 
<213> Synthetic 



36 
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<400> 84 

cgcgcgcaca tgtctctcta ccaggggctc ctgcag 

<210> 85 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 85 

cgcgcgcaca tgtcttacca ggggctcctg caggcc 

<210> 86 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 86 

cgcgcgcaca tgtctcaggg gctcctgcag gccctg 

<210> 87 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 87 

cgcgcgcaca tgtctgggct cctgcaggcc ctggaa 

<210> 88 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 86 

cgcgcgcaca tgtctctcct gcaggccctg gaaggg 

<210> 89 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 89 

cgcgcgcaca tgtctctgca ggccctggaa gggata 

<210> 90 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 90 

cgcgcgcaca tgtctcaggc cctggaaggg atatcc 

<2T&> dZ 
<211> 36 
<212> DNA 
<.2Vir> synthetic 

<400> 91 

cgcgcgcaca tgtctgccct ggaagggata tccccc 

<210> 92 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 92 

cgcgcgcaca tgtctctgga agggatarcc cccgag 
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<211> 36 
<212> DNA 
<213> Synthetic 

<400> 93 

cgcgcgcaca tgtctgaagg gatatccccc gagttg 36 

<210> 94 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 94 

cgcgcgcaca tgtctgggat atcccccgag ttgggt 36 

<210> 95 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 95 

cgcgcgcaca tgtctatatc ccccgagttg ggtccc 3* 

<210> 96 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 96 

cgcgcgcaca tgtcttcccc cgagttgggt cccacc 36 

<210> 97 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 97 

cgcgcgcaca tgtctcccga gttgggtccc accttg 3* 

<210> 98 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 98 

cgcgcgcaca tgtctgagtt gggtcccacc ttggac 36 

<210> 99 
<211> 36 
<212> DNA 
<213> Synthetic 

<4tl0> 99 

cgcgcgcaca tgtctttggg tcccaccttg gacaca 36 
<210> 100 

<211> 36 , 
<212> DNA 
<213> Synthetic 

<400> IOC 

cgcgcgcaca --qzccggc.ee cace ttggac acactg 36 

<210> 101 
<211> 36 
<212> DNA 
<212> Synthecic 



<400> 1C1 
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cgcgcgcaca tgtctcccac cttggacaca ctgcag 36 

<210> 102 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 102 

cgcgcgcaca tgtctacctt ggacacactg cagctg 

<210> 103 
<211> 36 
<212> DNA 
<213> Synthetic 

<4Q0> 103 

cgcgcgcaca tgtctttgga cacactgcag ctggac 

<210> 104 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 104 

cgcgcgcaca tgtctgacac actgcagctg gacgtc 

<210> 105 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 105 

cgcgcgcaca tgtctacact gcagctggac gtcgcc 

<210> 106 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 106 

cgcgcgcaca tgtctctgca gctggacgtc gccgac 

<210> 107 
<211> 36 
<212> DNA 
<213> synthetic 

<400> 107 

cgcgcgcaca tgtctcagct ggacgtcgcc gacttt 

<210> 108 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 108 

cgcgcgcaca tgtctctgga cgtcgccgac tttgcc 

<210> 109 
<211> 36 
<*I2> DNA 
<213> Synthetic 

<400> 109 

cgcgcgcaca rctctgacgt cgccgacttt gccacc 36 

<21C> HZ 
<211> 36 
<212> DNA 
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<213> Synthetic 
<400> 110 

cgcgcgcaca tgtctgtcgc cgactttgcc accacc 

<210> 111 
<211> 36 
<212> ENA 
<213> Synthetic 

<400> 111 

cgcgcgcaca tgtctgccga ctttgccacc accatc 

<210> 112 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 112 

cgcgcgcaca tgtctgactt tgccaccacc atctgg 

<210> 113 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 113 

cgcgcgcaca tgtcttttgc caccaccatc tggcag 

<210> 114 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 114 

cgcgcgcaca tgtctgccac caccatctgg cagcag 

<210> 115 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 115 

cgcgcgcaca tgtctaccac catc tggcag cagatg 

<210> 116 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 116 

cgcgcgcaca tgtctaccat ctggcagcag atggaa 

<210> 117 
<211> 36 
<212> DNA 
<213> Synthetic 

<40Q> 117 

cgcgcgcaca tgtctatctg gcagcagatg gaagaa 

<2iO> 116 
<2xl> 36 
<212> DNA 
<213> Synthetic 

<400> lis 

cgcgcgxaca tgtcttggca gcagatggaa gaactg 
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<210> 119 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 119 

cgcgcgcaca tgtctcagca gatggaagaa ctggga 

<210> 120 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 120 

cgcgcgcaca tgtctcagat ggaagaactg ggaatg 

<210> 121 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 121 

cgcgcgcaca tgtctatgga agaactggga atggcc 

<210> 122 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 122 

cgcgcgcaca tgtctgaaga actgggaatg gcccct 

<210> 123 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 123 

cgcgcgcaca tgtctgaact gggaatggcc cctgcc 

<210> 124 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 124 

cgcgcgcaca tgtctctggg aatggcccct gccctg 

<210> 125 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 125 

cgcgcgcaca tgtctggaat ggcccctgcc ctgcag 

<210> 126 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 126 

cgcgcgcaca tgtctatggc ccctgccctg cagccc 

<210> 127 

<211> 36 

<212> DNA 

<213> Synthetic 
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<400> 127 

cgcgcgcaca tgtctgcccc tgccctgcag cccacc 

<210> 128 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 128 

cgcgcgcaca tgtctcctgc cctgcagccc acccag 

<210> 129 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 129 

cgcgcgcaca tgtctgccct gcagcccacc cagggt 



<210> 130 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 130 

cgcgcgcaca tgtctctgca gcccacccag ggtgcc 

<210> 131 
<211> 36 
<212> DNA 
<213> Synthetic 



36 



<400> 131 

cgcgcgcaca tgtctcagcc cacccagggt gccatg 

<210> 132 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 132 

cgcgcgcaca tgtctcccac ccagggtgcc atgccg 

<210> 133 
«c211> 36 
<212> DNA 
<213> Synthetic 

<400> 133 

cgcgcgcaca tgtctaccca gggtgccatg ccggcc 

<210> 134 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 134 

cgcgcgcaca tgtctcaggg tgccatgccg gccttc 

<210> 135 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 135 

cgcgcgcaca tgtctggtgc catgccggcc ttcgcc 

<210> 136 
<211> 36 
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<212> DNA 
<2i3> Synthetic 

<400> 136 

cgcgcgcaca tgtctgccat gccggccttc gcctct 

<210> 137 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 137 

cgcgcgcaca tgtctatgcc ggccttcgcc tctgct 

<210> 138 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 138 

cgcgcgcaca tgtctccggc cttcgcctct gctttc 

<210> 139 
<211> 36 
<212> DNA 
<213> Synthetic 



36 



<400> 139 

cgcgcgcaca tgtctgcctt cgcctctgct ttccag 

<210> 140 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 140 

cgcgcgcaca tgtctttcgc ctctgctttc cagcgc 

<210> 141 
<211> 36 
<212> CNA 
<213> Synthetic 

<400> 141 

cgcgcgcaca tgtctgcctc tgctttccag cgccgg 



<210> 142 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 142 

cgcgcgcaca tgtcttctgc tttccagcgc cgggca 

<210> 143 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 143 

cgcgcgcaca tgtctgcttt ccagcgccgg gcagga 

<210> 144 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 144 

cgcgcgcaca tgtctttcca gcgccgggca ggaggg 
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<210> 145 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 145 

cgcgcgcaca tgtctcagcg ccgggcagga ggggtc 

<210> 146 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 146 

cgcgcgcaca tgtctcgccg ggcaggaggg gtcctg 

<210> 147 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 147 

cgcgcgcaca tgtctcgggc aggaggggtc ctggtt 

<210> 148 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 148 

cgcgcgcaca tgtctgcagg aggggtcctg gttgct 

<210> 149 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 149 

cgcgcgcaca tgtctggagg ggtcctggtt gctagc 

<210> 150 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 150 

cgcgcgcaca tgtctggggt cctggttgct agccat 

<210> 151 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 151 

cgcgcgcaca tgtctgtcct ggttgctagc catctg 

<210> 152 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 152 

cgcgcgcaca tgtctctggt tgctagccat ctgcag 

<210> 153 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 153 

cgcgcgcaca tgtctgttgc tagccatctg cagagc 36 

<210> 154 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 154 

cgcgcgcaca tgtctgctag ccatctgcag agcttc 36 

<210> 155 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 155 

cgcgcgcaca tgtctagcca tctgcagagc ttcctg 36 

<210> 156 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 156 

cgcgcgcaca tgtctcatct gcagagcttc ctggag 3"6 

<210> 157 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 157 

cgcgcgcaca tgtctctgca gagcttcctg gaggtg 36 

<210> 158 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 15B 

cgcgcgcaca tgtctcagag cttcctggag gtgtcg 36 

<210> 159 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 159 

cgcgcgcaca tgtctagctt cctggaggtg tcgtac 36 

<210> 160 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 160 

cgcgcgcaca tgtctctcct ggaggtgtcg taccgc 36 

<210> 161 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 161 

cgcgcgcaca tgtctctgga ggtgtcgtac cgcgtt 36 



<210> 162 
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<211> 36 
<212> DNA 
<213> Synthetic 

<400> 162 

cgcgcgcaca tgtctgaggt gtcgtaccgc gttcta 36 

<210> 163 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 163 

cgcgcgcaca tgtctgtgtc gtaccgcgtt ctacgc 36 

<210> 164 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 164 

cgcgcgcaca tgtcttcgta ccgcgttcta cgccac 36 

<210> 165 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 165 

cgcgcgcaca tgtcttaccg cgttctacgc cacctt 36 

<210> 166 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 166 

cgcgcgcaca tgtctcgcgt tctacgccac cttgcg 36 

<210> 167 
<211> 36 
<212> DNA 
<213> Synthetic 

<40O> 167 

cgcgcgcaca tgtctgttct acgccacctt gcgcag 36 

<210> 168 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 168 

cgcgcgcaca tgtctctacg ccaccttgcg cagccc 36 

<210> 169 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 169 

cgcgcgcaca tgtctcgcca ccttgcgcag cccgac 36 

<210> 170 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 170 
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cgcgcgcaca tgtctcacct tgcgcagccc gacatg _ 36 

<210> 171 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 171 

cgcgcgcaca tgtctcttgc gcagcccgac atggct 



<210> 172 
<211> 36 
<212> DNA 
<213> Synthetic 



<4O0> 172 

cgcgcgcaca tgtctgcgca gcccgacatg gctaca 

<210> 173 
<211> 36 
<212> DNA 
<213> Synthetic 

<4O0> 173 

cgcgcgcaca tgtctcagcc cgacatggct acacca 

<210> 174 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 174 

cgcgcgcaca tgtctcccga catggctaca ccattg 

<210> 175 

<211> 36 

<212> DNA 

<213> Synthetic 

<4O0> 175 

cgcgcgcaca tgtctcgcca ccttgcgcag cccact 



<210> 176 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 176 

cgcgcgcaca tgtctcacct tgcgcagccc actagt 

<210> 177 
<2ll> 36 
<212> KIA 
<213> Synthetic 

<400> 177 

cgcgcgcaca tgtctcttgc gcagcccact agtcat , 

<210> 178 
<211> 36 
<212> KCA, 
<213> Synthetic 

<4O0> 178 

cgcgcgcaca tgtctgcgca ycccactagt caicca 



<2i0> 1*?3 
<211> 36 
<212> DNA 
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<2X3> Synthetic 
<400> 179 

cgcgcgcaca tgtctcagcc cactagtcat ccacct 

<210> 180 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 180 

cgcgcgcaca tgtctcccac tagtcatcca cctatg 

<210> 181 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 181 

cgcgcgcaca tgtctcgcca ccttgcgcag cceggc 

<210> 182 
<211> 36 
<212> dna 
<213> Synthetic 

<400> 182 

cgcgcgcaca tgtctcacct tgcgcagccc ggcggc 

<210> 183 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 183 

cgcgcgcaca tgtctcttgc gcagcccggc ggcggc 

<210> 184 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 184 

cgcgcgcaca tgtctgcgca gcccggcggc ggctct 

<210> 185 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 185 

cgcgcgcaca tgtctcagcc cggcggcggc tctgac 

<210> 186 
<211> 36 

<■} 1 j> Synshecic 

<CCtt> IE 6 

cgcgcgcaca tgtctcccgg cggcggctct gacatg 

<210> 187 
<211> 3-- 
<212> D!vA 
<21S> Synthetic 

<400> 187 

tatatatccg gccgcagcca zgzcqqqczg cgcaae 
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<210> 188 
<211> 36 
<212> DNA 
<2X3> Synthetic 

<400> 188 

tatatatgcg gccgcagcca tgtcacgcgt acgatt 

<210> 189 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 169 

tatatatgcg gccgcagcca tgtcagagcc gccgcc 

<210> 190 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 190 

tatatatgcg gccgctgtag ccatgtcggg ctgcgc 

<210> 191 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 191 

tatatatgcg gccgctgtag ccatgtcacg cgtacg 



<210> 192 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 192 

tatatatgcg gccgctgtag ccatgtcaga gccgcc 

<210> 193 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 193 

tatatatgcg gccgctggtg tagccatgtc gggctg 

<210> 194 
<211> 36 
<222> DNA 
<213> Synthetic 

<400> 194 

tatatatgcg gccgctggtg tagccatgtc acgcgt 

<210> 195 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 195 

tatatatgcg gccgctggtg tagccatgtc agagcc 3* 

<210> 196 
<211> 36 
<212> DNA 
<2I3> Synthetic 
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<400> 196 

tatatatgcg gccgccaatg gtgtagccat gtcggg 

<210> 197 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 197 

tatatatgcg gccgccaatg gtgtagccat gtcacg 

<210> 198 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 198 

tatatatgcg gccgccaatg gtgtagccat gtcaga 

<210> 199 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 199 

tatatatgcg gccgcgccca atggtgtagc catgtc 



<210> 200 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 200 

tatatatgcg gccgcagggc ccaatggtgt agccat 

<210> 201 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 201 

tatatatgcg gccgcggcag ggcccaatgg tgtagc 

<210> 202 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 202 

tatatatgcg gccgcgctgg cagggcccaa tggtgt 36 

<210> 203 
<211> 36 
<212> DNA 
<ZIS> Synthetic 

<43$> 203 

tatatatgcg gccgcggagc tggcagggcc caatgg 36 

<iiO> 204 
<211> S£ 
<212> DMA 
<213> Synthetic 

<4Q0> 204 

tatatatgcg gccgco£S9S iii-^cacagg -gcccaa 36 

<210> 205 
<211> 36 
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<212> DNA 
<213> Synthetic 

<400> 205 

tatatatgcg gccgcgggca gggagctggc agggcc 

<210> 206 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 206 

tatatatgcg gccgcctggg gcagggagct ggcagg 

<210> 207 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 207 

tatatatgcg gccgcgctct ggggcaggga gctggc 

<210> 208 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 208 

tatatatgcg gccgcgaagc tctggggcag ggagct 

<210> 209 

<211> 36 

<212> DNA 

<213> Synthetic 



36 



<400> 209 

tatatatgcg gccgccagga agctctgggg caggga 

<210> 210 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 2X0 

tatatatgcg gccgcgagca ggaagctctg gggcag 

<210> 211 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 211 

tatatatgcg gccgccttga gcaggaagct ctgggg 

<210> 212 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 212 

-azataigcg gccgcagact tgagcaggaa gctctg 

<210> 213 
<211> 36 
<212> dma 



<400> 213 

tatatatgcg cccgctaaag acttgagcag gaagct 
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<210> 214 

<211> 36 

<2l'2> DNA 

<213> Synthetic 

<400> 214 

tatatatgcg gccgcctcta aagacttgag caggaa 

<210> 215 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 215 

tatatatgcg gccgcttgct ctaaagactt gagcag 

<210> 216 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 216 

tatatatgcg gccgccactt gctctaaaga cttgag 

<210> 217 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 217 

tatatatgcg gccgccctca cttgctctaa agactt 

<210> 218 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 218 

tatatatgcg gccgccttcc tcacttgctc taaaga 

<210> 219 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 219 

tatatatgcg gccgcgatct tcctcacttg ctctaa 

<210> 220 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 220 

tatatatgcg gccgcctgga tcttcctcac ttgctc 

<210> 221 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 221 

tatatatgcg gccgcgccct ggatcttcct cacttg 

<i::> 222 

<2ll> 36 
<212> DNA 
<213> Synthetic 
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<400> 222 

tatatatgcg gccgcatcgc cctggatctt ccccac 

•<210> 223 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 223 

tatatatgcg gccgcgccat cgccctggat cttcct 

<210> 224 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 224 

tatatatgcg gccgctgcgc catcgccctg gatctt 

<210> 225 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 225 

tatatatgcg gccgccgctg cgccatcgcc ctggat 



<210> 226 
<21X> 36 
<212> DNA 
<213> Synthetic 



<400> 226 

tatatatgcg gccgcgagcg ctgcgccatc gccctg 

<210> 227 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 227 

tatatatgcg gccgcctgga gcgctgcgcc atcgcc 

<210> 228 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 228 

tatatatgcg gccgcctcct ggagcgctgc gccatc 

<210> 229 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 229 

tatatatgcg gccgccttct cctggagcgc tgcgcc 

<210> 230 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 230 

tatatatgcg gccgccagct tctc<:tggag cgctgc 



<210> 231 
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<211> 36 
<212> DNA 
<213> Synthetic 

<400> 231 

tatatatgcg gccgcacaca gcttctcctg gagcgc 36 

<210> 232 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 232 

tatatatgcg gccgcggcac acagcttctc ctggag 36 

<210> 233 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 233 

tatatatgcg gccgcggtgg cacacagctt ctcctg 36 

<210> 234 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 234 

tatatatgcg gccgcgtagg tggcacacag cttctc 36 

<210> 235 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 235 

tatatatgcg gccgccttgt aggtggcaca cagctt 36 

<210> 236 

<211> 36 

<212> DNA 

<213> Synthetic • 

<400> 236 

tatatatgcg gccgccagct tgtaggtggc acacag 36 

<21Q> 237 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 237 

tatatatgcg gccgcgcaca gcttgtaggt ggcaca 36 

<210> 238 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 238 

tatatatgcg gccgcgtggc acagcttgta ggtggc 36 

<210> 239 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 239 
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tatatatgcg gccgcggggt ggcacagctt gtaggt 36 

<210> 240 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 240 

tatatatgcg gccgcctcgg ggtggcacag cttgta 36 

<210> 241 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 241 

tatatatgcg gccgcctcct cggggtggca cagctt 36 

<210> 242 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 242 

tatatatgcg gccgccagct cctcggggtg gcacag 3S 

<210> 243 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 243 

tatatatgcg gccgccacca gctcctcggg gtggca 3€ 

<210> 244 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 244 

tatatatgcg gccgccagca ccagctcctc ggggtg 36 

<210> 245 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 245 

tatatatgcg gccgcgagca gcaccagctc ctcggg 36 

<210> 246 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 246 

tatatatgcg gccgctccga gcagcaccag ctcctc f 36 

<210> 247 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 247 

tatatatgcg gccgcgtgtc cgagcagcac cagctc 36 

<210> 248 
<211> 36 
<212> DNA 
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<213> Synthetic 
<400> 248 

tatatatgcg gccgcagagc gtccgagcag caecag 3* 

<210> 249 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 249 

tatatatgcg gccgccagag agtgtccgag cagcac 36 

<210> 250 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 250 

tatatatgcg gccgcgccca gagagtgtcc gagcag 3 6 

<210> 251 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 251 

tatatatgcg gccgcgatgc ccagagagtg tccgag 36 

<210> 252 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 252 

tatatatgcg gccgcgggga tgcccagaga gtgtcc 36 

<210> 253 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 253 

tatatatgcg gccgcccagg ggatgcccag agagtg 36 

<210> 254 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 254 

tatatatgcg gccgcagccc aggggatgcc cagaga 36 

<210> 255 
<211> 36 
<212> DNA 

<213> Synthetic . 
<400> 255 

tatatatgcg gccgcgggag cccaggggat gcccag 36 

<210> 256 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 256 

tatatatgcg gccgccaggg gagcccaggg gatgcc 36 
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<210> 257 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 257 

tatatatgcg gccgcgctca ggggagccca ggggat 

<210> 258 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 258 

tatatatgcg gccgcggagc tcaggggagc ccaggg 

<210> 259 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 259 

tatatatgcg gccgcgcagg agctcagggg agccca * 6 

<210> 260 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 260 

tatatatgcg gccgcggggc aggagctcag gggagc 

<210> 261 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 261 

tatatatgcg gccgcgctgg ggcaggagct cagggg 

<210> 262 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 262 

tatatatgcg gccgcctggc tggggcagga gctcag 

<210> 263 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 263 

tatatatgcg gccgcggcct ggctggggca ggagct 

<210> 264 / 
<211> 36 
<212> DNA 
<2X3> Synthetic 

<400> 264 

tatatatgcg gccgccaggg cctggctggg gcagga 36 

<210> 265 

<211> 36 

<212> DNA 

<213> Synthetic 
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<400> 265 

tatatatgcg gccgcctgca gggcctggct ggggca 3 * 

<210> 266 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 266 

tatatatgcg gccgccagct gcagggcctg gctggg 36 

<210> 267 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 267 

tatatatgcg gccgctgcca gctgcagggc ctggct 3 ° 

<210> 268 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 268 

tatatatgcg gccgcgcctg ccagctgcag ggcctg 36 

<210> 269 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 269 

tatatatgcg gccgcgcagc ctgccagctg cagggc 36 

<210> 270 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 270 

tatatatgcg gccgccaagc agcctgccag ctgcag 

<210> 271 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 211 

tatatatgcg gccgcgctca agcagcctgc cagctg 3 * 

<220> 272 
<2U> 36 
<212> DKrt. 
<213> Synthetic 

<400> 272 

tatatatgcg gccgcgttgc tcaagcagcc tgccag 36 

<210> 273 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 273 

tatatatgcg gccgcgaggt tgctcaagca gcctgc 36 



<210> 274 
<211> 36 
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<212> DNA 
<213> Synthetic 

<400> 274 

tatatatgcg gccgcatgga ggttgctcaa gcagcc 36 

<210> 275 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 275 

tatatatgcg gccgcgctat ggaggttgct caagca 36 

<210> 276 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 276 

tatatatgcg gccgcgccgc tatggaggtt gctcaa 36 

<210> 277 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 277 

tatatatgcg gccgcaaggc cgctatggag gttgct 36 

<210> 278 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 278 

tatatatgcg gccgcgaaaa ggccgctatg gaggtt 3* 

<210> 279 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 279 

tatatatgcg gccgcgagga aaaggccgct atggag 3€ 

<210> 2tBC 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 280 

tatatatgcg gccgcgtaga ggaaaaggcc gctatg 36 

<210> 281 
<211> 36 

<212> DNA i 
<213> Synthetic 

<400> 231 

ratacacgeg gccgccvcat agaggaaaag gccgcc 36 

<210> 282 
<211> 36 

«;213> Synthetic 

<400> 282 

tatatatgcg gccgccccct ggtagaggaa aaggcc 36 
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<210> 283 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 283 

tatatatgcg gccgcgagcc cctggtagag gaaaag 

<210> 284 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 284 

tatatatgcg gccgccagga gcccctggta gaggaa 

<210> 285 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 285 

tatatatgcg gccgcctgca ggagcccctg gtagag 

<210> 286 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 286 

tatatatgcg gccgcggcct gcaggagccc ctggta 

<210> 287 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 287 

tatatatgcg gccgccaggg cctgcaggag cccctg 

<210> 288 
<21X> 36 
<212> DNA 
<213> synthetic 

<400> 288 

catatatgcg gccgcttcca gggcctgcag gagccc 

<21fc> 2&* 
<211> 36 
<2t2> DMA 
<213> Synthetic 

<400> 289 

tatatatgcg gccgcccctt ccagggcctg caggag 

<210> 290 
<rii> 36 
<2V2> DNA 
<2X3> Synthetic 

<400> 290 

tatatatgcg gccgctatcc cttccagggc ccgcag 

<210> 291 
<2il> 36 
<212> DNA 
<213> Synthetic 
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<400> 291 

tatatatgcg gccgcggata tcccttccag ggcctg 

<210> 292 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 292 

tatatatgcg gccgcggggg atatcccttc cagggc 

<210> 293 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 293 

tatatatgcg gccgcctcgg gggatatccc ttccag 

<210> 294 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 294 

tatatatgcg gccgccaact cgggggatat cccttc 

<210> 295 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 295 

tatatatgcg gccgcaccca actcggggga tatccc 

<210> 296 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 296 

tatatatgcg gccgcgggac ccaactcggg ggatat 

<210> 297 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 297 

tatatatgcg gccgcggtgg gacccaactc ggggga 

<21Q> 13ft 
<21I> 36 

<£13> Synthetic 

<400> 298 

ta-atargcg gccgccaagg tgggacccaa ctcggg 

<210> 299 
<211> 36 
<212> DNA 
<213> Sy-ZZinz.*L z 

<400> 299 

tatatatgcg gccgcgtcca aggtcrngacc caaccc 
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36 



3* 



3* 



<210> 3CC 



t 
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<2ll> 36 
<212> DNA 
<213> Synthetic 

<400> 300 

tatatatgcg gccgctgtgt ccaaggtggg acccaa 

<210> 301 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 301 

tatatatgcg gccgccagtg tgtccaaggt gggacc 

<210> 302 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 302 

tatatatgcg gccgcctgca gtgtgtccaa ggtggg 

<210> 303 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 303 

tatatatgcg gccgccagct gcagtgtgtc caaggt 

<210> 304 
<211> 36 
<212> OKA 
<213> Synthetic 

<400> 304 

tatatatgcg gccgcgtcca gctgcagtgt gtccaa 

<210> 305 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 305 

tatatatgcg gccgcgacgt ccagctgcag tgtgtc 

<210> 306 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 306 

tatatatgcg gccgcggcga cgtccagctg cagtgt 

<210> 307 

<ixl> DNA 
<213> Synthetic 

«400> 3Q7 

tdtatatgcg gccgcgtcgg cgacgtccag ctgcag 

<210> 308 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 308 
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tatatatgcg gccgcaaagt cggcgacgtc cagctg 

<210> 309 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 309 

tatatatgcg gccgcggcaa agtcggcgac gtccag 

<210> 310 
<211> 36 
<212> DNA 



<40O> 310 

tatatatgcg gccgcggtgg caaagtcggc gacgtc 

<210> 311 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 311 

tatatatgcg gccgcggtgg tggcaaagtc ggcgac 

<210> 312 
<211> 36 

<212> dna 
<213> Synthetic 

<400> 312 

tatatatgcg gccgcgatgg tggtggcaaa gtcggc 

<210> 313 
<211> 36 
<212> UNA 
<213> Synthetic 

<400> 313 

tatatatgcg gccgcccaga tggtggtggc aaagtc 

<210> 314 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 314 

tatatatgcg gccgcctgcc agatggtggt ggcaaa 

<210> 315 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 315 

t&z&zazgcg gccgcctgct gccagatggt ggtggc 

<210> 316 
<211> 36 
<212> DNA 
•: :• I j > Synthetic 

<400> 316 

tatatatgcg gccgccatct gctgccagat ggtggt 



<213> Synthetic 




<210> 31? 
<2U> 36 
<212> DNK 
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<211> 36 
<212> DNA 
<213> Synthetic 

<400> 231 

tatatatgcg gccgcacaca gcttctcctg gagcgc 

<210> 232 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 232 

tatatatgcg gccgcggcac acagcttctc ctggag 

<210> 233 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 233 

tatatatgcg gccgcggtgg cacacagctt ctcctg 

<210> 234 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 234 

tatatatgcg gccgcgtagg tggcacacag cttctc 



<210> 235 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 235 

tatatatgcg gccgccttgt aggtggcaca cagctt 

<210> 236 

<211> 36 

<212> DNA 

<213> Synthetic • 

<400> 236 

tatatatgcg gccgccagct tgtaggtggc acacag 

<210> 237 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 237 

tatatatgcg gccgcgcaca gcttgtaggt ggcaca 

<210> 238 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<400> 238 

tatatatgcg gccgcgtggc acagcttgta ggtggc 

<210> 239 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 239 



DMcnnrtrv ^wn 
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tatatatgcg gccgcggggt ggcacagctt gtaggt j0 

<210> 240 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 240 

tatatatgcg gccgcctcgg ggtggcacag cttgta 

<210> 241 
<211> 36 
' <212> DMA 

<213> Synthetic 

<400> 241 

tatatatgcg gccgcctcct cggggtggca cagctt 

<210> 242 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 242 

tatatatgcg gccgccagct cctcggggtg gcacag 

<210> 243 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 243 

tatatatgcg gccgccacca gctcctcggg gtggca 

<210> 244 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<400> 244 

tatatatgcg gccgccagca ccagctcctc ggggtg 

<210> 245 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 245 

tatatatgcg gccgcgagca gcaccagctc ctcggg ->° 

<210> 246 

<211> 36 

<2i2> DNA 

<213> Synthetic 

<400> 246 

tatatatgcg gccgctccga gcagcaccag ctcctc 

<210> 247 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 247 

tatatatgcg gccgcgtgtc cgagcagcac cagctc J 

<210> 248 
<211> .36 
<212> DNA 
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<213> Synthetic 
<400> 248 

tatatatgcg gccgcagagt gtccgagcag caccag 



36 



<210> 249 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 249 

tatatatgcg gccgccagag agtgtccgag cagcac 

<210> 250 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 250 

tatatatgcg gccgcgccca gagagtgtcc gagcag 

<210> 251 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 251 

tatatatgcg gccgcgatgc ccagagagtg tccgag 

<210> 252 
<211> 36 
<212> DNA 
<213> Synthetic 

<4O0> 252 

tatatatgcg gccgcgggga tgcccagaga gtgtcc 

<210> 253 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 253 

tatatatgcg gccgcccagg ggatgcccag agagtg 



36 



36 



36 



36 



36 



<210> 254 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 254 

tatatatgcg gccgcagccc aggggatgcc cagaga 

<210> 255 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 255 

tatatatgcg gccgcgggag cccaggggat gcccag 



36 



36 



<210> 256 
<211> 36 
<212> DNA 
<213> Synthetic 



<4O0> 256 

tatatatgcg gccgccaggg gagcccaggg gatgcc 



36 
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<210> 257 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 257 

tatatatgcg gccgcgctca ggggagccca ggggat 36 

<210> 258 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 256 

tatatatgcg gccgcggagc tcaggggagc ccaggg 36 

<210> 259 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 259 

tatatatgcg gccgcgcagg agctcagggg agccca 36 

<210> 260 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 260 

tatatatgcg gccgcggggc aggagctcag gggagc 36 

<210> 261 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 261 

tatatatgcg gccgcgctgg ggcaggagct cagggg 36 

<210> 262 
<211> 36 
<212> DNA, 
<213> Synthetic 

<400> 262 

tatatatgcg gccgcctggc tggggcagga gctcag 36 

<210> 263 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 263 

tatatatgcg gccgcggcct ggctggggca ggagct 3* 

<210> 264 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 264 

tatatatgcg gccgccaggg cctggctggg gcagga 3* 

<210> 265 

<211> 36 

<212> DNA 

<213> Synthetic 
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<400> 265 

tatatatgcg gccgcctgca gggcctggct ggggca 

<210> 266 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 266 

tatatatgcg gccgccagct gcagggcctg gctggg 3 * 

<210> 267 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 267 

tatatatgcg gccgctgcca gctgcagggc ctggct 

<210> 268 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 268 

tatatatgcg gccgcgcctg ccagctgcag ggcctg 

<210> 269 
<21X> 36 
<212> DNA 
<213> Synthetic 

<400> 269 

tatatatgcg gccgcgcagc ctgccagctg cagggc 

<210> 270 
<211> 36 
<212> dna 
<213> Synthetic 

<400> 270 

tatatatgcg gccgccaagc agcctgccag ctgcag 

<210> 271 
<211> 36 
<2X2> DNA 
<223> Synthetic 

<400> m 

tatatatgcg gccgcgctca agcagcctgc cagctg 

<21Q> 272 
<2U> 36 
<Z12> DKA 
<2i3> Synthetic 

<400> 272 

tatatatgcg gccgcgttgc tcaagcegcc tgccag 3 * 

<210> 273 
<211> 36 
<212> UNA 
<213> Synthetic 

<400> 273 

tatatatgcg gccgcgaggt tgctcaagca gcctgc - t 



<210> 274 
<211> 3€ 
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<212> DNA 
<213> Synthetic 

<400> 274 

tatatatgcg gccgcatgga ggttgctcaa gcagcc 36 

<210> 275 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 275 

tatatatgcg gccgcgctat ggaggttgct caagca 36 

<210> 276 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 276 

tatatatgcg gccgcgccgc tatggaggtt gctcaa 36 

<210> 277 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 277 

tatatatgcg gccgcaaggc cgctatggag gttgct 36 

<210> 278 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 278 

tatatatgcg gccgcgaaaa ggccgctatg gaggtt 36 

<210> 279 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 279 

latatacgcg gccgcgagga aaaggccgct atggag 36 

<210> 2%C 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 280 

tatatatgcg gccgcgtaga ggaaaaggcc gctatg 36 

<210> 281 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 2B1 

iatacacgcg gccgccrgg- agaggaaaag gccgcc 36 

<210> 282 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 282 

tatatatgcg gccgccccct ggtagag^aa aaggcc 



36 
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<210> 2B3 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<400> 283 

tatatatgcg gccgcgagcc cctggtagag gaaaag 36 

<210> 284 
<211> 36 
<212> DNA 
<213> Synthetic 

<40O> 284 

tatatatgcg gccgccagga gcccctggta gaggaa 36 

<210> 285 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 285 

tatatatgcg gccgcctgca ggagcccctg gtagag 3*6 

<210> 286 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 286 

tatatatgcg gccgcggcct gcaggagccc ctggta 36 

<210> 287 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 287 

tatatatgcg gccgccaggg cctgcaggag cccctg 36 

<210> 288 

<211> 36 

<212> DNA 

<213> Synthetic 

<40O> 288 

tatatatgcg gccgcttcca gggcctgcag gagccc 36 

<21ft> 2fi* 
<211> 36 
<2I2> DNA 
<213> Syntnetic 

<400> 289 

tatatatgcg gccgcccctt ccagggcctg caggag 36 

<210> 290 

«cm> 36 

<2V2> DNA 

<2I3> Synthetic 

<40O> 290 

tatatatgcg gccgctatcc cttccagggc ccgcag 36 

<210> 291 

<2il> 36 

<212> DNA 

<213> Synthetic 
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<400> 291 

tatatatgcg gccgcggata tcccttccag ggcctg 

<210> 292 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 292 

tatatatgcg gccgcggggg atatcccttc cagggc 

<210> 293 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 293 

tatatatgcg gccgcctcgg gggatatccc ttccag 



<2I0> 294 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 294 

tatatatgcg gccgccaact cgggggatat cccttc 36 

<2X0> 295 
<211> 36 
<212> DNA 
<2X3> Synthetic 

<400> 295 

tatatatgcg gccgcaccca actcggggga tatccc 36 



<210> 296 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 296 

tatatatgcg gccgcgggac ccaactcggg ggatat 36 

<210> 297 
<2Xl> 36 
<212> DMA 
<213> Synthetic 

<400> 297 

tatatatgcg gccgcggtgg gacccaactc ggggga 36 

<21G> 29ft 
<211> 36 
<212> IBS* 
<£13> Synthetic 

<400> 298 

tetatatgcg gccgccaagg tgggacccaa ctcggg 36 

<210> 299 
<211> 36 
<212> DNA 
< 21.3 9- SiTicnewic 



<400> 299 

tatatatgcg gccgcgtcca aggtgggacc caaccc 36 



<210> 3CC 
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<2U> 36 
<212> DNA 
<2X3> Synthetic 

<400> 300 

tatatatgcg gccgctgtgt ccaaggtggg acccaa 36 

<210> 301 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 301 

tatatatgcg gccgccagtg tgtccaaggt gggacc 36 

<210> 302 
<211> 36 
<212> DMA. 
<213> Synthetic 

<400> 302 

tatatatgcg gccgcctgca gtgtgtccaa ggtggg 3 * 

<210> 303 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 303 

tatatatgcg gccgccagct gcagtgtgtc caaggt 36 

<210> 304 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 304 

tatatatgcg gccgcgtcca gctgcagtgt gtccaa 36 

<210> 305 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 305 

tatatatgcg gccgcgacgt ccagctgcag tgtgtc 36 

<210> 306 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 306 

tatatatgcg gccgcggcga cgtccagctg cagtgt 3 * 
<210> 307 

<a«> 3.6 

<l'±4> DMA 
<213> Synthetic 

c4O0> 307 

ritatatgcg gccgcgtcgg cgacgtccag ctgcag 36 

<210> 308 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 308 
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tatatatgcg gccgcaaagt cggcgacgtc cagctg 36 

<210> 309 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 309 

tatatatgcg gccgcggcaa agtcggcgac gtccag 36 

<210> 310 
<211> 36 
<212> DNA 
<213> Synthetic 

<40O> 310 

tatatatgcg gccgcggtgg caaagtcggc gacgtc 36 

<210> 311 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 311 

tatatatgcg gccgcggtgg tggcaaagtc ggcgac 36 

<210> 312 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 312 

tatatatgcg gccgcgatgg tggtggcaaa gtcggc 36 

<210> 313 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 313 

tatatatgcg gccgcccaga tggtggtggc aaagtc 36 

<210> 314 
<211> 36 
<212> DNA 
<212> Synthetic 

<400> 314 

tatatatgcg gccgcctgcc agatggtggt ggcaaa 36 

<210> 315 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 315 

tetacanpcsr gccgcctgct gccagatggt ggtggc 36 

<21C> 31€ 
<211> 36 
<212> DNA 
•::-I3> Synthetic 

<400> 316 

tatatatgcg gccgccatct gctgccagat ggtggt 36 

<210> 31? 
<2U> 36 
<212> DNA 
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<213> Synthetic 
<400> 317 

tatatatgcg gccgcttcca tctgctgcca gatggt 36 

<210> 318 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 318 

tatatatgcg gccgcttctt ccatctgctg ccagat 36 

<210> 319 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 319 

tatatatgcg gccgccagtt cttccatctg ctgcca 36 

<210> 320 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 320 

tatatatgcg gccgctccca gttcttccat ctgctg 36 

<210> 321 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 321 

tatatatgcg gccgccattc ccagttcttc catctg 36 

<210> 322 
<211> 36 
* <212> DNA 

<213> Synthetic 

<400> 322 

tatatatgcg gccgcggcca ttcccagttc ttccat 36 

<210> 323 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 323 

tatatatgcg gccgcagggg ccattcccag ttcttc 36 

<210> 324 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 324 

tatatatgcg gccgcggcag gggccattcc cagttc 36 

<210> 325 
<211> 36 
<212> DNA 
<213> Syntaecic 

<400> 325 

tatatatgcg gccgccaggg caggggccat tcccag 36 
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<210> 326 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 326 

tatatatgcg gccgcctgca gggcaggggc cattcc 36 

<210> 327 

<211> 36 

<212> DNA 

<213> Synthetic 

<40O> 327 

tatatatgcg gccgcgggct gcagggcagg ggccat 36 

<210> 328 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 328 

tatatatgcg gccgcggtgg gctgcagggc aggggc 36 

<210> 329 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 329 

tatatatgcg gccgcctggg tgggctgcag ggcagg 36 

<210> 330 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 330 

tatatatgcg gccgcaccct gggtgggctg cagggc 36 

<210> 331 
<211> 36 
<212> DNA 
<2X3> Synthetic 

<400> 331 

tatatatgcg gccgcggcac cctgggtggg ctgcag 36 

<210> 332 
<211> 36 
<212> DNA 
<213> Synthetic 

<4O0> 332 

tatatatgcg gccgccatgg caccctgggt gggctg 36 

<210> 333 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 333 

tatatatgcg gccgccggca tggcaccctg ggtggg 36 

<210> 334 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 334 

tatatatgcg gccgcggccg gcatggcacc ctgggt 

<210> 335 
<211> 36 
<212> DNA 
<213> synthetic 

<400> 335 

tatatatgcg gccgcgaagg ccggcatggc accctg 3 ° 

<210> 336 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 336 

tatatatgcg gccgcggcga aggccggcat ggcacc 3 * 

<210> 337 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 337 

tatatatgcg gccgcagagg cgaaggccgg catggc 

<210> 338 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 338 

tatatatgcg gccgcagcag aggcgaaggc cggcat 

<210> 339 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 339 

tatatatgcg gccgcgaaag cagaggcgaa ggccgg 

<210> 340 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 340 

tatatatgcg gccgcctgge aagcagaggc gaaggc 36 

<210> 341 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 341 fi 
tatatatgcg gccgcgcgct ggaaagcaga ggcgaa 36 

<210> 342 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 342 

tatatatgcg gccgcccggc gctggaaagc agaggc 36 



<210> 343 
<211> 36 
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<212> DNA 
<213> Synthetic 

<400> 343 

tatatatgcg gccgctgccc ggcgctggaa agcaga 

<210> 344 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 344 

tatatatgcg gccgctcctg cccggcgctg gaaagc 

<210> 345 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 345 

tatatatgcg gccgcccctc ctgcccggcg ctggaa 

<210> 346 
<211> 36 
<212> DNA 
<213> Synthetic 

<4O0> 346 

tatatatgcg gccgcgaccc ctcctgcccg gcgctg 

<210> 347 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 347 

tatatatgcg gccgccagga cccctcctgc ccggcg 

<210> 348 
• <211> 36 

<212> DNA 
<213> Synthetic 

<400> 348 

tatatatgcg gccgcaacca ggacccctcc tgcccg 

<210> 349 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 349 

tatatatgcg gccgcagcaa ccaggacccc tcctgc 

<210> 350 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 350 

tatatatgcg gccgcgctag caaccaggac ccctcc 

<210> 351 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 351 

tatatatgcg gccgcatggc tagcaaccag gacccc 
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<210> 352 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 352 

tatatatgcg gcegccagat ggctagcaac caggac 36 

<210> 353 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 353 

tatatatgcg gccgcctgca gatggctagc aaccag 36 

<210> 354 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 354 

tatatatgcg gccgcgctct gcagatggct agcaac 36 

<210> 355 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 355 

tatatatgcg gccgcgaagc tctgcagatg gctagc 36 

<210> 356 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 356 

tatatatgcg gccgccagga agctctgcag atggct 36 

<210> 357 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 357 

tatatatgcg gccgcctcca ggaagctctg cagatg 36 

<210> 358 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 358 

tatatatgcg gccgccacct ccaggaagct ctgcag 36 

<210> 359 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 359 

tatatatgcg gccgccgaca cctccaggaa gctctg 3^ 

<210> 360 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 360 

tatatatgcg gccgcgtacg acacctccag gaagct 36 

<210> 361 

<211> 36 

<212> DMA 

<213> Synthetic 

<400> 361 

tatatatgcg gccgcgcggt acgacacctc caggaa 36 

<210> 362 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 362 

tatatatgcg gccgcaacgc ggtacgacac ctccag 36 

<210> 363 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 363 

tatatatgcg gccgctagaa cgcggtacga cacctc 36 

<210> 364 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 364 

tatatatgcg gccgcgcgta gaacgcggta cgacac 36 

<210> 365 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 365 

tatatatgcg gccgcgtggc gtagaacgcg gtacga 36 

<210> 366 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 366 

tatatatgcg gccgcaaggt ggcgtagaac gcggta 36 

<210> 367 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 367 

tatatatgcg gccgccgcaa ggtggcgtag aacgcg 36 

<210> 368 
<211> 39 
<212> DNA 
<2X3> Synthetic 



<400> 368 

tatatatgcg gccgcctgcg caaggtggcg tagaacgcg 
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<213> Synthetic 
<400> 317 

tatatatgcg gccgcttcca tctgctgcca gatggt 36 

<210> 318 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 316 

tatatatgcg gccgcttctt ccatctgctg ccagat 36 

<210> 319 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 319 

tatatatgcg gccgccagtt cttccatctg ctgcca 36 

<210> 320 
<211> 36 
<212> CNA 
<213> Synthetic 

<400> 320 

tatatatgcg gccgctccca gttcttccat ctgctg 36 

<210> 321 
<21l> 36 
<212> DNA 
<213> Synthetic 

<400> 321 

tatatatgcg gccgccattc ccagttcttc catctg 36 

<210> 322 
<211> 36 
<212> DNA 
<212> Synthetic 

<400> 322 

tatatatgcg gccgcggcca ttcccagttc ttccat 36 

<210> 323 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 323 

tatatatgcg gccgcagggg ccattcccag ttcttc 36 

<210> 324 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 324 

tatatacgcg gccgcggcag gggccattcc cagttc 36 

<210> 325 
<211> 36 
<2i2> DNA 
<213> Synthetic 

<40Cr> 325 

tatatatgcg gccgccaggg caggggccat tcccag 36 
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<210> 326 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 326 

tatatatgcg gccgcctgca gggcaggggc cattcc 36 

<210> 327 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 327 

tatatatgcg gccgcgggct gcagggcagg ggccat 36 

<210> 328 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 328 

tatatatgcg gccgcggtgg gctgcagggc aggggc 3*6 

<210> 329 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 329 

tatatatgcg gccgcctggg tgggctgcag ggcagg 36 

<210> 330 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 330 

tatatatgcg gccgcaccct gggtgggctg cagggc 36 

<210> 331 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 331 

tatatatgcg gccgcggcac cctgggtggg ctgcag 36 

<210> 332 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 332 

tatatatgcg gccgccatgg caccctgggt gggctg 36 

<210> 333 , 
<211> 36 
<212> DNA 
<213> Synthetic 

<40O> 333 

tatatatgcg gccgccggca tggcaccctg ggtggg 36 

<210> 334 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 334 

tatatatgcg gccgcggccg gcatggcacc ctgggt 

<210> 335 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 335 

tatatatgcg gccgcgaagg ccggcatggc accctg 

<210> 336 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 336 

tatatatgcg gccgcggcga aggccggcat ggcacc 

<210> 337 
<211> 36 
<212> DNA. 
<213> Synthetic 

<400> 337 

tatatatgcg gccgcagagg cgaaggccgg catggc 

<210> 338 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 338 

tatatatgcg gccgcagcag aggcgaaggc cggcat 

<210> 339 
<21l> 36 
<212> DNA 
<213> Synthetic 

* 

<400> 339 

tatatatgcg gccgcgaaag cagaggcgaa ggccgg 

<210> 340 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 340 

tatatatgcg gccgcctgga aagcagaggc gaaggc 36 

<210> 341 
<211> 36 
<212> dna 
<213> Synthetic 

<400> 341 t 
tatatatgcg gccgcgcgct ggaaagcaga ggcgaa 36 

<210> 342 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 342 

tatatatgcg gccgcccggc gctggaaagc agaggc 36 



<210> 343 
<211> 36 
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<212> DNA 
<213> Synthetic 

<400> 343 

tatatatgcg gccgctgccc ggcgctggaa agcaga 36 

<210> 344 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 344 

tatatatgcg gccgctcctg cccggcgctg gaaagc 36 

<210> 345 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<400> 345 

tatatatgcg gccgcccctc ctgcccggcg ctggaa 36 

<210> 346 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 346 

tatatatgcg gccgcgaccc ctcctgcccg gcgctg 36 

<210> 347 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 347 

tatatatgcg gccgccagga cccctcctgc ccggcg 36 

<210> 348 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 348 

tatatatgcg gccgcaacca ggacccctcc tgcccg 36 

<210> 349 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 349 

tatatatgcg gccgcagcaa ccaggacccc tcctgc 36 

<210> 350 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 350 

tatatatgcg gccgcgctag caaccaggac ccctcc 36 

<210> 351 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 351 

tatatatgcg gccgcacggc tagcaaccag gacccc 
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<210> 352 
<211> 36 
<212> DNA 
. <213> Synthetic 

<400> 352 

tatatatgcg gccgccagat ggctagcaac caggac 36 

<210> 353 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 353 

tatatatgcg gccgcctgca gatggctagc aaccag 36 

<210> 354 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 354 

tatatatgcg gccgcgctct gcagatggct agcaac 36 

<210> 355 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 355 

tatatatgcg gccgcgaagc tctgcagatg gctagc 36 

<210> 356 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 356 

tatatatgcg gccgccagga agctctgcag atggct 36 

<210> 357 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 357 

tatatatgcg gccgcctcca ggaagctctg cagatg 36 

<210> 358 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 358 

tatatatgcg gccgccacct ccaggaagct ctgcag 36 

<2X0> 359 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 359 

tatatatgcg gccgccgaca cctccaggaa gctctg 36 

<210> 360 

<211> 36 

<212> DNA 

<213> Synthetic 
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<400> 360 

tatatatgcg gccgcgtacg acacctccag gaagct 36 

<210> 361 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 361 

tatatatgcg gccgcgcggt acgacacctc caggaa 36 

<210> 362 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 362 

tatatatgcg gccgcaacgc ggtacgacac ctccag 36 

<210> 363 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 363 

tatatatgcg gccgctagaa cgcggtacga cacctc 36 

<210> 364 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<4O0> 364 

tatatatgcg gccgcgcgta gaacgcggta cgacac 36 

<210> 365 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 365 

tatatatgcg gccgcgtggc gtagaacgcg gtacga 36 

<210> 366 
<211> 36 
<212> DNA 
<213> Synthetic 

<4O0> 366 

tatatatgcg gccgcaaggt ggcgtagaac gcggta 36 

<210> 367 
<211> 36 
<212> DNA 
<213> Synthetic 

i 

<400> 367 

tatatatgcg gccgccgcaa ggtggcgtag aacgcg 36 

<210> 368 
<211> 39 
<212> DNA 
<213> Synthetic 



<400> 368 

tatatatgcg gccgcctgcg caaggtggcg tagaacgcg 
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Method of producing permuteins by scanning 

permutagenesis 

Priority 

The present application claims priority under Title 35, United States Code, 
5 § 119 of United States Provisional Application Serial No. 60/101,908, filed 

September 25, 1998. 

Field of the invention 

A method of producing circularly-permuted proteins (permuteins) by 
scanning permutagenesis comprises making and inserting a series of circularly- 

10 permuted genes into a display vector, expressing these genes such that the -gene 

products are localized to the surface of the display vector, generating a library of 
display vectors presenting the permuted protein, affinity-selecting the display 
vectors with a target protein that can bind the permuted protein, isolating and 
analyzing clones of selected display vectors to identify the circularly-permuted 

15 protein. The invention further discloses methods of expressing and uses of 

permuteins. 

Background of the invention 

Protein permutagenesis 

Circularly permuted proteins are made by reordering the primary sequence 
20 of a parent protein. The amino and carboxy terminal ends of the parent protein 

are joined by a peptide linker and new amino and carboxy terminal ends are 
generated at other positions in the sequence. This technique of -generating 
variants has been applied to a wide variety of proteins (Table 1). 

Circularly permuted proteins, in many cases, are structurally and 
25 functionally similar to their non-permuted parent molecule after they undergo 

refolding. The information necessary to direct the folding of proteins into tertiary 
structures is present in secondary structural domains. Vectorial "folding of proteins 
from their native amino to carboxy ends is not often observed. The ability of 
permuteins to retain structural and functional properties is remarkable, extending 
30 earlier observations on the plasticity of proteins with respect to amino acid 
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substitutions (Olins P.O. et aL, J. BioL Chem. 270: 23754-23760, 1995; Lowman 
and Wells, J. MoL BioL 234: 564-578, 1993) and short amino acid insertions 
(Sondek, J. and D. Shortle, Proteins 7: 387-393, 1990; Shortle, D. and J. Sondek, 
Curr. Opin. Biotechnol. 6: 299-305). 

5 Protein sequence reorganization 

Rearrangements of DNA sequences serve an important role in evolution by 
generating a diversity of new proteins differing in structure and function. Gene 
duplication and exon shuffling, for example, generate diversity and provide 
organisms with a competitive advantage since the basal mutation rate is low 
10 (Doolittle, Protein Science 1: 191-200, 1992). 

Recombinant DNA methods have facilitated studies on the effect of 
sequence transposition on protein folding, structure, and function. The first 
rearrangement of proteins using this approach was described by Ooldenberg and 
Creighton (J. MoL BioL 165:407-413, 1983). A new N-terminus is selected at an 

15 internal site (breakpoint) of the original sequence, the new sequence having the 

same order of amino acids as the original from the breakpoint until it reaches an 
amin o acid that is at or near the original C-terminus. At this point the new 
sequence is joined, either directly or through an additional portion of sequence 
(linker), to an amino acid that is at or near the original N-terminus, and the new 

20 sequence continues with the same sequence as the original until it reaches a point 

that is at or near the amino acid that was N-terminal to the breakpoint site of the 
original sequence, this residue forming the new C-terminus of the chain. Similar 
approaches have also been used in other studies (Cunningham et al., Proc. Natl. 
Acad. Sci. U.SJL 76:3218-3222, 1979; Teather & Erfle, J. BacterioL 172: 3837- 

25 3841, 1990; ScMmming et al., Eur. J. Biochem. 204: 13-19, 1992; Yamiuchi and 

Minamikawa, FEBS Lett. 260:127-130, 1991: MacGregor et al., FEBS Lett. 
378:263-266, 1996). 

These general approaches have been applied to proteins which range in 
size from 58 to 462 amino acids (Goldenberg & Creighton, J. Mol. Biol. 165:407- 

30 413, 1983; Li & Coffino, MoL Cell. Biol. 13:2377-2383, 1993). The proteins 

represent a broad range of structural classes, including proteins that contain 
predominantly alpha helix (interleukin-4; Kreitman et al., Cytokine 7:311-318, 
1995), beta sheet (interleukin-1; Horlick et al., Protein Eng. 5:427-431, 1992), or 
mixtures of the two types of secondary structures (yeast phosphoribosyl 

35 anthranilate isomerase; Luger et al., Science 243:206-210, 1989). 



WO 00/18905 



3 



PCT/US99/20891 



Although broad categories of protein function are represented in these 
sequence reorganization studies, the results of these studies have been highly 
variable. In many cases substantially lower activity, solubility, or thermodynamic 
stability were observed (E. coli dihydrofolate reductase, aspartate 

5 transcarbamoylase, phosphoribosyl anthranilate isomerase, -glyceraldehyde-3- 

phosphate dehydrogenase, ornithine decarboxylase, ompA, yeast phosphoglycerate 
dehydrogenase). In other cases, the sequence rearranged protein appeared to have 
many nearly identical properties as its natural counterpart (basic pancreatic 
trypsin inhibitor, T4 lysozyme, ribonuclease Tl, Bacillus p-glucanase, interleukin- 
10 IP, a-spectrin SH3 domain, pepsinogen, interleukin-4). In exceptional cases, an 

unexpected improvement over some properties of the natural sequence was 
observed, e.g., the solubility and refolding rate for rearranged a-spectrin SH3 
domain sequences, and the receptor affinity and anti-tumor activity of transposed 
interleukin-4-Pseudomonas exotoxin fusion molecule (Kreitman et al., Proc. Natl. 

15 Acad. ScL U.SJL 91:6889-6893, 1994; Kreitman et al., Cancer Res. SS:3357-3363, 

1995). 

The primary motivation for reorganization studies has been to study the 

role of short-range and long-range interactions in protein folding and stability. 

Sequence rearrangements of this type convert a subset of interactions that are 
20 long-range in the original sequence into short-range interactions in the new 

sequence, and vice versa. The fact that many of these sequence rearrangements 
* are able to attain a conformation with at least some activity is persuasive evidence 

that protein folding occurs by multiple folding pathways (Viguera et al., J. Mol. 

Biol. 247:670-681, 1995). In the case of the SH3 domain of alpha-spectrin, 
25 choosing new termini at locations that corresponded to beta hairpin turns resulted 

in proteins with slightly less stability, but which were nevertheless able to fold. 

The positions of the internal breakpoints used in the studies cited above 
are found exclusively on the surface of proteins, and are distributed throughout the 
linear sequence without any obvious bias towards the ends or the middle (the 

30 variation in the relative distance from the original N-terminus to the breakpoint is 

ca. 10 to 80% of the total sequence length). The linkers connecting the original N- 
and C-tennini in these studies have ranged from 0 to 9 residues. In one case (Yang 
& Schachman, Proc. Natl. Acad. ScL UJS^A. 90:11980-11984, 1993), a portion of 
sequence has been deleted from the original C-terminal segment, and the 

35 connection made from the truncated C-terminus to the original N-terminus. 

Flexible hydrophilic residues such as Gly and Ser are frequently used in the 
linkers. Viguera et aL(J. MoL Biol. 247:670-681, 1995) compared joining the 
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original N- and C- termini with 3- or 4-residue linkers; the 3-residue linker was 
less thennodynamically stable. Protasova et al. (Protein Eng. 7:1373-1377, 1994) 
used 3- or 5-residue linkers in connecting the original N-termini of E. -coli 
dihydrofolate reductase; only the 3-residue linker produced protein in good yield. 

5 Protein permutagenesis can be used to optimize the activity of fusion 

proteins or proteins conjugated to other molecules. A fusion between interleukin-4 
(IL-4) and Pseudomonas exotoxin has been permuted resulting in a protein that 
has the first amino acid of the IL-4 domain at position 38 and the new carboxy end 
occurs at amino acid position 37 (Kreitman, R. J. et al., Proc Natl Acad Sci USA 91: 

10 6889-6893, 1994). The permuted fusion has increased affinity for the IL-4 receptor, 

increased cytotoxicity to IL-4 receptor bearing renal carcinoma cells, and increased 
anti-tumor activity in a murine model, compared to the non-permuted parent 
fusion protein (Kreitman, R. J. et al., Proc Natl Acad Sci USA 91: 6889-6893, 1994; 
Kreitman, R. J. et al., Cancer Res. 55:3357-3363, 1995; Puri, R. K. et al., Cellular 

15 Immunol. 171: 80-86, 1996). Increased potency of the permuted molecule is 

believed to result from a reduction in steric interference between the IL-4 domain 
in the parent molecule and its receptor. 

Steric hindrance is likely to be a concern for other chimeric proteins which 
interact with receptors through a relatively large area of their surface. The same 
20 issue also arises with bioconjugates, containing relatively small chemicals 

conjugated to proteins or other molecules in complex polymers (Rose, K et al., 
Molecular Immunology 32: 1031-1037, 1995). 

Phage display methods 

Display methods allow affinity selection of protein variants from a library 
25 of displayed proteins or peptides (Clackson, T. and J A. Wells, Tibtech 12: 173-184, 

1994; Winter, G., Drug Development Res. 33: 71-89, 1994). Many biological 
entities can be used in display methodologies (so-called genetic packages" for 
presentation, including bacterial and eukaryotic cells, various eukaryotic and 
prokaryotic viruses, and spores), but the most commonly used vehicles used for 
30 display are filamentous bacteriophage, as used herein. We envision the possibility 

that a genetic package other than the particular phage used here could be used to 
present libraries of permuteins, and if so, constitute essentially the same 
invention. 
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Foreign proteins are presented on the surface of a phage particle, and the 
gene encoding the foreign protein is encapsulated in the virion. Because they are 
linked by the phage particle, affinity isolation of the presented protein also leads to 
affinity isolation of the corresponding genes. Extremely large libraries of phage 
5 presented proteins are constructed and affinity screened very rapidly. From the 

standpoint of how quickly mutant proteins can be made and screened for activity, 
phage display is the most efficient mutagenesis technique currently available. 

Functional properties of permuteins 

Permuteins can have improved biological properties by acting through 
10 several mechanisms. The permutein acting on the same type of cell as its parent 

molecule, may have increased binding, or other action, by virtue of increased 
avidity. Dimers or higher order multimers of these proteins with themselves or 
other chemical groups, including proteins, can have increased efficacy or potency, 
or both. 

15 Permuteins can also have improved therapeutic properties through a 

variety of mechanisms such as: (1) alterations in the overall on- or off-rates or or 
K„ of the ligand(s) on the target cell; (2) activation or blockade of complementary 
receptor signaling pathways; and/or (3) more specific targeting of to the cell of 
interest. The permuteins may also possess a unique pharmacokinetic distribution 

20 and clearance profile (Dehmer et al., Circulation, 91, 2188-2194, 1995; Tanaka et 

al.,. Nature Medicine, 3, 437-442, 1997). 

Permuteins can also have improved properties in vivo, compared to the two 
components individually, as a result of alterations in biodistribution or half-life. 
The improved properties can also result from the binding of the permutein to one 
25 or more of the receptors, pharmacokinetics, or uptake of the permutein is altered in 

a favorable manner. 

Molecular biology approaches have traditionally been used to permute 
proteins (Horlick, RA. et al., Protein Engineering 427-431, 1992) although 
chemical approaches have been used to make small permuted proteins 
30 (Goldenberg, D. P. and T. E. Creighton, J. MoL BioL 165: 407-413, 1983). These 

approaches are relatively labor intensive, limiting the number of permuteins that 
can be generated and efficiently screened for the desired biological activities. 
Rapid methods of generating permuteins, coupled with efficient methods for 
screening are needed that will result in the identification of novel active molecules. 
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Summary of the invention 

The present invention is an improved method for generating permuteins 
(scanning permutagenesis) based on the display of proteins on bacteriophage 
surface proteins. Phage display is a powerful, yet convenient tool, traditionally 
5 used for mutagenesis and screening (Clackson, T. and J A. Wells, Tibtech 12: 173- 

184, 1994). Improvements to this technology allow the rapid generation and 
screening of libraries of permuteins. Variables, such as position of the new termini 
and the length and composition of peptide linkers can easily be varied to generate 
libraries of the desired diversity. 

0 The present invention relates to methods of producing biologically-active 

circularly permuted proteins of the formula C'-L'-N 1 , derived from a parent protein 
of the formula N'-C 1 , wherein C is comprised of a segment derived from the 
carboxy portion of said parent protein; N 1 is comprised of a segment derived from 
the amino terminal portion of said parent protein; and L 1 is a che mic al bond or a 

15 linker, linking C 1 to the amino terminus of L 1 and carboxy terminus of L 1 to the 

amino terminus of N"; comprising the steps of: (a) making a series of circularly- 
permuted genes; (b) inserting said circularly-permuted genes into a display vector; 
(c) expressing said circularly-permuted genes such that the proteins encoded by 
said genes are presented on the surface of the display vector, (d) generating a 

20 library of display vectors presenting the expressed circularly permuted protein; Xe) 

* affinity-select the presenting display vectors with a target protein that can bind a 
biologically-active circularly-permuted protein; (f) isolate and analyze clones of 
selected display vectors to identify the presented circularly-permuted protein. 

Preferably the method of making a series of circularly-permuted -genes is 
25 selected from the group consisting of making a tandemly-repeated intermediate, 

total synthesis of a synthetic gene, assembly of a gene from synthetic 
oligonucleotides, DNA amplification, and limited digestion of a circular 
intermediate. 

Preferably, the display vector is selected from the group consisting of 
30 bacteriophage display vectors, bacteria, and baculovirus vectors. Even more 

preferably the presentation vector is a bacteriophage. Even more preferably, the 
presentation vector is bacteriophage M13. Most preferably, the presentation vector 
is a bacteriophage Ml 3 gene III vector. 
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Preferably the method of making a series of circularly permuted genes is a 
method of making a tandem repeat intermediate. Even more preferably circularly 
permuted genes are amplified from the repeat by gene amplification. 

Preferably the method of affinity selection comprises the steps consisting of 
5 (a) binding said presentation display vectors to a target protein; (b) eluting said 

display vectors; (c) amplifying said display vectors; and (d) biopanning a pool of 
said amplified display vectors. 

Preferably, the length of C 1 in the permutein is longer than the length of C 1 
in said parent protein. More preferably, the length of C 1 in the permutein is 
10 shorter than the length of C 1 in said parent protein. Most preferably, the length of 

C 1 in the permutein is the same length as the length of C 1 in said parent protein. 

Preferably, the length of N 1 in the permutein is longer than the length of 
N 1 in said parent protein. More preferably, the length of N 1 in the permutein is 
shorter than the length of N 1 in said parent protein. Most preferably, the length of 
15 N 1 in the permutein is the same length as the length of N' in said parent protein. 

The invention also contemplates circularly permuted proteins of the 
formula C'-L'-N 1 made by the method of scanning permutagenesis. Preferably, the 
DNA sequence encoding said linker V is selected from the group consisting oFSEQ 
ID NO: 1 through SEQ ID NO: 368. 

20 Preferably, the circularly-permuted protein is the G-CSF receptor agonist 

domain of a species of mylepoietin (MPO). MPO is one member of a family of novel 
dual cytokine receptor agonists (McKearn, J.P., Myelorestorative activities of 
synthokine and myelopoietin. In Proceedings of the 1996 IBC Conference on 
Therapeutic Applications of Cytokines, pp. 4.3.1-4.3.18, 1996) which are amenable 

25 to manipulation by phage display (Merlin, S. et al., Applied Biochemistry and 

Biotechnology 67: 15-29, 1997; Lee, S.C., Phage presentation Tor cytokine 
engineering. IBC's Second International Conference on Display Technologies, 
1997). 

Brief description of the figures 
30 Figure 1. Schematic depiction of scanning permutagenesis 

Plate A of Figure 1 shows the strategy to generate a sc annin g 
permutagenesis phage display library. A plasmid containing directly-repeated 
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tandem copies of the hG-CSF -gene, for example, is constructed by standard 
methods. The tandem repeat plasmid is used as the template for PCR 
amplification of genes encoding permuted proteins. Each copy of the G-CSF gene 
is indicated in light gray (turquoise), and a DNA segment encoding a peptide linker 
5 is indicated in dark gray (red). 

In individual PCR reactions, oligonucleotide primers that initiate PCR 
polymerization at the first nucleotide of a chosen codon of G-CSF, and directing 
polymerization to the end of the tandem construct specifying the carboxy end of the 
protein encoded on the template is annealed to the tandem template. Also, a 

10 second specific primer is also annealed to the template that initiates 

polymerization at the last nucleotide of the codon encoding the amino add 
immediately preceding the codon where polymerization begins with "first primer, 
and which directs polymerization in the opposite direction from that first primer. 
Amplification between these two primers produces a DNA segment encoding a 

15 permuted protein. For example, amplification between the primer indicated by a 

black arrow initiating at codon 2 and the primer indicated by the blue arrow and 
initiating at the codon before 2 (codon 1) produces an amplified gene encoding a 
permuted protein whose amino terminal residue is amino acid 2 of the native 
protein, and whose final amino acid is amino acid 1 of the native protein. 

20 A linker peptide is present between the first and final amino acids of the 

parent protein (residues 1 and 174 in this example). A total of 174 individual 
amplifications would produce a complete collection of all permuted proteins of this 
example. More limited collections containing only a selected set of permuteins can 
be made, as well as more extensive collections made from multiple tandem 

25 template plasmids, each containing a different linker sequence between the first 

and last residues of the two directly repeated tandem gene sequences. The 
collection of amplified segments can then be inserted into a phagemid presentation 
vector by standard methods. Phagemid particles produced from these presentation 
constructs are the scanning permutagenesis phage display library. 

30 Plate B of Figure 1 shows the affinity screening of a phage display library 

(See Clackson, T. and JA. Wells, Tibtech 12: 173-184, 1994; Winter, G., Drug 
Development Res. 33: 71-89, 1994). In this example, a hG-CSF scanning 
permutagenesis library as described in Figure 1A is screened using the hG-CSF 
receptor expressed on mammalian cells as the afBnity reagent. In Figure IB, 

35 individual presented proteins are indicated by the shaded circles or diamonds and 

the affinity reagent is indicated by the light gray (pink) rectangles. Presentation 
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library particles are exposed to affinity reagent, unbound particles are washed 
away, and receptor-bound particles are eluted. The eluted particles are amplified 
in E. coli, and the screening cycle is repeated. During any round of the screening 
cycle, the genes encoded (in the present example encoding permuted proteins) by 
5 the selected particles can be expressed and evaluated. 

Figure 2. Permuteins presented in the scanning permutagenesis library 

Human G-CSF (serl7) protein is depicted as a string of circles, each circle 
corresponding to a single amino acid residue. Amino and carboxy ends of the 
protein are indicated. The amino acids of helical regions are indicated by medium 

.10 gray balls, while the amino acids of inter-helical loops are indicated in light gray 

balls (See Hill et aL, Proc. NatL Acad. Sci. USA 90: 5167-5171, 1993). Amino-ends 
of the permuteins made for presentation in the library are indicated in dark gray. 
Asterisks indicate the breakpoints of the presented permuteins which were 
isolated by affinity screening with cells expressing hG-CSF receptor as illustrated 

15 in IB. 

Figure 3. Bioactivity of permuteins identified by affinity screening of the 
sc annin g permutagenesis library 

Individual permuteins were expressed transiently in ma mmalian <?ells. 
Permeation molecules in the culture supernatants were quantitated by EI2SA, and 
20 the proliferative activity of clones was determined using BAF-3-cells dependent on 

G-CSF for growth. The horizontal axis indicate concentration of protein and the 
vertical axis indicate incorporation of tritiated thymidine. 

Definitions 

The following is a list of abbreviations and the corresponding meanings as 
25 used interchangeably herein: 

g = gram(s) 

mg = milligram(s) 

ml and mL = milliliters) 

RT = room temperature 
30 ug and pg = microgram(s) 

uL and pi = microliter! s) 

The following is a list of definitions of various terms used herein: 
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The term "permutein" means a circularly-permuted protein: a protein in 
which the amino and carboxy ends of the parent protein are joined together by a 
peptide linker sequence of zero or more amino acids. The amino and carboxy ends 
of the permuted protein occur at amino acids within the parental sequence. 

The terms "chemical ligation" and "conjugation" mean a chemical reaction 
which covalently links two similar or dissimilar functional groups together 
intramolecularly or intermolecularly. 

The term "peptide linker' means a compound which forms a carboxamide 
bond between two groups having one or more peptide linkages (CONH-) and serves 
as a connector for the propose of amelioration of the distance or space orientation 
between two molecules. 

The term "native sequence" refers to an amino acid or nucleic acid sequence 
which is identical to a wild-type or native form of a gene or protein. 

The terms "mutant amino acid sequence," "mutant protein", "variant 
protein", "mutein", or "mutant polypeptide" refer to a polypeptide having an amino 
acid sequence which varies from a native sequence due to amino acid additions, 
deletions, substitutions, or all three, or is encoded by a nucleotide sequence from 
an intentionally-made variant derived from a native sequence. 

Detailed description of the invention 
Determination of the amino and carboxyl termini of permuteins 

The present invention encompasses circularly permuted-proteins of the 
formula C'-L'-N 1 prepared by phage display techniques. The polypeptide can be 
joined either directly or through a linker segment. The term "directly* defines 
permuteins in which the polypeptide ends are joined without a linker. Thus L 1 
represents a chemical bond or a linker, preferably a polypeptide segment to which 
both C 1 and N 1 are joined, wherein C 1 is comprised of a segment derived from the 
carboxy portion of the parent protein and N l is comprised of a segment derived 
from the amino terminal portion of a parent protein represented by the general 
formula N'-C 1 . Preferably, N J and C 1 in the permuted protein C'-L'-N 1 are the 
same length as in the parent protein N'-C, but each may be independently shorter 
or longer depending on the desired structural characteristics of the permutein. 
Most commonly L 1 is a linear peptide in which C 1 and N 1 are joined by amide 
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bonds, linking C 1 to the amino terminus of L 1 and carboxy terminus of L 1 to the 
amino terminus of N\ 

Additional peptide sequences may also be added to facilitate purification or 
identification of permuteins (e.g., poly-His). A highly antigenic peptide may also be 
5 added that would enable rapid assay and facile purification of the permuteins by a 

specific monoclonal antibody. 

Determination of the linker 

The linking group (L 1 ) is generally a polypeptide of between 1 and *500 
amino acids in length. The linkers joining the two molecules are preferably 
designed to (1) allow the two molecules to fold and act independently of each other, 
(2) not have a propensity for developing an ordered secondary structure which 
could interfere with the functional domains of the two proteins, (3) have minimal 
hydrophobic characteristics which could interact with the functional protein 
domains and (4) provide steric separation of C 1 and N 1 such that C 1 and N* could 
interact simultaneously with their corresponding receptors on a single cell. 
Typically surface amino acids in flexible protein regions include Gly, Asn and Ser. 
Virtually any permutation of amino acid sequences containing Gly, Asn and "Ser 
would be expected to satisfy the above criteria for a linker sequence. Other neutral 
amino acids, such as Thr and Ala, may also be used in the linker sequence. 
Additional amino acids may also be included in the linkers due to the addition of 
unique restriction sites in the linker sequence to facilitate construction of the 
multi-functional proteins. 

Preferred L 1 linkers of the present invention include sequences selected 
from the group of formulas: 

25 (SEQ ID NO: 1) through SEQ ID NOi 268) 

Other linkers are also contemplated by the invention. The present 
invention is, however, not limited by the form, size or number of linker sequences 
employed. The only requirement of the linker is that it does not functionally 
interfere with the folding and function of the individual molecules of the multi- 
30 functional protein. 

Utility of permuteins 

Permuteins of the present invention may exhibit useful properties such as 
having similar or greater biological activity when compared to a single factor or by 
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having improved half-life or decreased adverse side effects, or a combination of 
these properties. 

Permuteins which have little or no activity maybe useful as antigens tor 
the production of antibodies for use in immunology or immunotherapy, as probes or 
as intermediates used to construct other useful permuteins. 

The permuteins of the present invention may have an improved 
therapeutic profile as compared to their parent molecules. For example, some 
permuteins of the present invention may have a similar or more potent activities 
relative to other compounds or proteins without having a similar or corresponding 
increase in side-effects. This is particularly true of multifunctional or fusion 
protein therapeutics, where permutation may relieve steric and other hindrances 
that impair the activity of the parent fusion molecules (see Kreitman, R. J. et aL, 
Proc Natl Acad Sci USA 91: 6889-6893, 1994; Kreitman, R. J. et aL, Cancer Res. 
55:3357-3363, 1995, for examples). 

A general utility of permuteins is in the area of nanoscale devices described 
alternatively as "nanobiologicaT or "nanobiotechnological." These are nanoscale 
devices con tainin g both precise structure nanomaterials and biological functional 
components (such as proteins). Nanodevices have been the subject of several 
reviews (Lee, S.C., Trends in Biotechnology, 16: 239-240, 1998). 

Nanobiological/nanobiotechnological devices generally contain proteins 
covalently coupled to polymers or other non-biological precise structure materials. 
Issues of steric and other interferences with protein activity are applicable to 
proteins in nanobiological/nanobiotechnological devices and are highly analogous 
to the issues with multifunctional/fusion proteins discussed above. Protein 
permutation is folly expected to offer a viable approach to deal with these 
considerations, just as it does in the case of fusion proteins (Kreitman, R. J. et aL, 
Proc Natl Acad Sci USA 91: 6889-6893, 1994; Kreitman, R. J. et al.,., Cancer Res. 
55:3357-3363, 1995). 

Examples 

The following examples will illustrate the invention in greater detail, 
although it will be understood that the invention is not limited to these specific 
examples. Various other examples will be apparent to the person skilled in the art 
after reading the present disclosure without departing from the spirit and scope of 
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the invention. It is intended that all such other examples be included within the 
scope of the appended claims. 

General Materials and Methods 

General methods of cloning, expressing, and characterizing proteins are 
5 found in T. Maniatis et al., Molecular Cloning, A Laboratory Manual, Cold Spring 

Harbor Laboratory, 1982, and references cited therein, incorporated herein by 
reference; and in J. Sambrook et al., Molecular Cloning, A Laboratory Manual, 2 nd 
edition, Cold Spring Harbor Laboratory, 1989, and references cited therein, 
incorporated herein by reference. 

10 Unless noted otherwise, all specialty chemicals were obtained from Sigma 

Chemical Company (St. Louis, MO). Restriction endonucleases and T4 DNA ligase 
were obtained from New England Biolabs (Beverly, MA) or Boehringer Mannheim 
(Indianapolis, IN). 

Strains, plasmids, and bacteriophage 

15 The bacterial strains used in these studies are listed in Table 1. Plasmids 

and bacteriophage used or constructed in this study are listed in Tables 2 and 3, 
respectively. 

Phage and phagemid stocks were made and manipulated as described 
(Kay, B.K., Winter, J., and McCafferty, J., Phage Display of Peptides and Proteins, 
20 Academic Press, San Diego, California, 1996; Merlin, S. et al., Applied 

Biochemistry and Biotechnology 67: 15-29, 1997). 

Transformation of EL colt strains 

E. coli strains (Table 1), such as DHScc™ (Life Technologies, Gaithersburg, 
MD) and TGI (Amersham Corp., Arlington Heights, ID are used for 
25 transformation of ligation reactions and are the hosts used to prepare plasmid 

DNA for transfecting mammalian cells. E, coli strains, such as JM101 (Tanisch- 
Perron et al., Gene, 33: 103-119, 1985) and MON105 tObukowicz et al., AppL and 
Envir. After., 58: 1511-1523, 1992) can be used for expressing the multi-functional 
proteins of the present invention in the cytoplasm or periplasmic space. 

30 DH5a™ Subcloning efficiency cells are purchased as competent cells and 

are ready for transformation using the manufacturer s protocol, while both E. coli 
strains TGI and MON105 are rendered competent to take up DNA using a CaCl 2 
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method. Typically, 20 to 50 mL of cells are grown in LB medium (1% Bacto- 
tryptone, 0.5% Bacto-yeast extract, 150 mM NaCl) to a density of approximately 
1.0 optical density unit at 600 nanometers (OD600) as measured by a Baush & 
Lomb Spectronic spectrophotometer (Rochester, NY). The ceUs are collected by 
centrifugation and resuspended in one-fifth culture volume of CaCL, solution tSO 
mM CaCL, 10 mM Tris-Cl, pH7.4) and are held at 4°C for 30 minutes. The cells are 
again collected by centrifugation and resuspended in one-tenth culture volume of 
CaClj solution. Ligated DNA is added to 0.2 mL of these cells, and the samples are 
held at 4°C for 30-60 minutes. The samples are shifted to 42°C for two minutes 
and 1.0 mL of LB is added prior to shaking the samples at 37"C for one hour. Cells 
from these samples are spread on plates (LB medium plus 1:5% Bacto-agar) 
containing either ampicillin (100 micrograms/mL, ug/mL) when selecting for 
ampicillin-resistant transformants, or spectinomycin (75 ug/mL) when selecting for 
spectinomycin-resistant transformants. The plates are incubated overnight at 
37°C. 

Colonies are picked and inoculated into LB plus appropriate antibiotic < 100 
ug/mL ampicillin or 75 ug/mL spectinomycin) and are grown at 37°C while 
shaking. 

DNA isolation and characterization 

DNA constructs were made and propagated in E. coli using standard 
molecular biology techniques (Sambrook, J. et al., Molecular Cloning, A Laboratory 
Manual, 2°" edition, Cold Spring Harbor Laboratory, 1989). 

Plasmid DNA can be isolated by a number of different methods and using 
commercially available kits known to those skilled in the art. Plasmid DNA is 
isolated using the Promega Wizard™ Miniprep kit (Madison, WD, the Qiagen 
QIAwelt Plasmid isolation kits (Chatsworth, CA) or Qiagen Plasmid Midi or Mini 
kit- These kits follow the same general procedure for plasmid DNA isolation. 
Briefly, cells are pelleted by centrifugation (5000 x g), the plasmid DNA released 
with sequential NaOH/acid treatment, and cellular debris is removed by 
centrifugation (10000 x g). The supernatant (containing the plasmid DNA) is 
loaded onto a column containing a DNA-binding resin, the column is washed, and 
plasmid DNA eluted. After screening for the colonies with the plasmid of interest, 
the E. coli cells are inoculated into 50-100 ml of LB plus appropriate antibiotic'for 
overnight growth at 37°C in an air incubator while shaking. The purified plasmid 
DNA is used for DNA sequencing, further restriction enzyme digestion, additional 
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subcloning of DNA fragments and transfection into E. coli, mammalian cells, or 
other cell types. 

Sequence confirmation 

DNA sequence analysis was performed using the Genesis 2000 DNA 
5 analysis system using standard methods (Prober et al., Science 238: 336-341, 

1987). 

Purified plasmid DNA is resuspended in dHjO and its concentration is 
determined by measuring the absorbance at 260/280 nm in a Bausch and Lomb 
Spectronic 601 UV spectrometer. DNA samples are sequenced using ABI PRISM™ 

10 DyeDeoxy™ terminator sequencing chemistry (Applied Biosystems Division of 

Perkin Elmer Corporation, Lincoln City, CA) kits (Part Number 401388 or 402078) 
according to the manufacturer's suggested protocol usually modified by the 
addition of 5% DMSO to the sequencing mixture. Sequencing reactions are 
performed in a DNA thermal cycler (Perkin Elmer Corporation, Norwalk, CT) 

15 following the recommended amplification conditions. Samples are purified to 

remove excess dye terminators with Centri-Sep™ spin columns (Princeton 
Separations, Adelphia, NJ) and lyophilized. Fluorescent dye labeled sequencing 
reactions are resuspended in deionized formamide, and sequenced on denaturing 
4.75% polyacrylamide-8M urea gels using ABI Model 373A and Model 377 

20 automated DNA sequencers. Overlapping DNA sequence fragments are analyzed 

and assembled into master DNA contigs using Sequencher DNA analysis software 
(Gene Codes Corporation, Ann Arbor, MD. 

Expression of permuted proteins in mammalian cells 

To obtain sufficient protein for analysis of the bioactivity of individual 
25 MPO molecules containing cphG-CSFs, DNA segments containing individual 

affinity-selected MPO: cphGCSFs were subcloned into a mammalian expression 
vector, and expressed transiently in BHK cells as described below. 

The BHK-21 cell line can be obtained from the ATCC (Rockville, MD). The 
cells are cultured in Dulbecco's modified Eagle media (DMEM/high-glucose), 
30 supplemented to 2 iaM (mM) L-glutamine and 10% fetal bovine serum (FBS). This 

formulation is designated BHK growth media. Selective media is BHK -growth 
media supplemented with 453 units/mL hygromycin B (CalBiochem, San Diego, 
CA;. The BHK-21 cell line was previously stably transfected with the HSV 
transactivating protein VP16, which transactivates the IE110 promoter found on 
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the plasmid pMON3359 and pMON3633 and the IE 175 promoter found in the 
plasmid pMON3360B (Hippenmeyer, P.J. and Pegg, L.E., Curr. Opin. BiotechnoL 
6: 548-552, 1995). The VP16 protein drives expression of genes inserted behind the 
IE110 or IE175 promoter. BHK-21 cells expressing the transactivating protein 
5 VP16 are designated BHK-VP16. The plasmid pMON1118 expresses the 

hygromycin resistance gene from the SV40 promoter (Highkin et al., Poultry Sci., 
70: 970-981, 1991). A similar plasmid, pSV2-hph, is available from ATCC. 

BHK-VP16 cells are seeded into a 60 millimeter (mm) tissue culture dish at 
3 x 10 s cells per dish 24 hours prior to transfection. Cells are transfected for 16 

10 hours in 3 mL of "OPTIMEM"™ (Gibco-BRL, Gaithersburg, MD) containing 10 ug 

of plasmid DNA containing the gene of interest, 3 ug hygromycin resistance 
plasmid, pMON1118, and 80 ug of Gibco-BRL UPOFECTAMINE"™ per dish. 
The media is subsequently aspirated and replaced with 3 mL of growth media. At 
48 hours post-transfection, media from each dish is collected and assayed for 

15 activity (transient conditioned media). The cells are removed from the dish by 

trypsin-EDTA, diluted 1:10, and transferred to 100 mm tissue culture dishes 
containing 10 mL of selective media. After approximately 7 days in selective 
media, resistant cells grow into colonies several millimeters in diameter. The 
colonies are removed from the dish with filter paper (cut to approximately the 

20 same size as the colonies and soaked in trypsin/EDTA) and transferred to 

individual wells of a 24 well plate containing 1 mL of selective media. After the 
clones are grown to confluence, the conditioned media is re-assayed, and positive 
clones are expanded into growth media. 

Affinity selection and screening of phagemids 

25 Affinity reagent used for the identification of functional MPO molecules 

con tainin g cphG-CSF (MPO: cphG-CSF) species from the library were BHK cells 
expressing the hG-CSF receptor on their surface. The library pool was subjected to 
iterative affinity selection (four rounds) against BHK cells expressing the h-GCSF 
receptor using previously described techniques (Merlin, S. et al., Applied 

3ft Biochemistry and Biotechnology 67: 15-29, 1997). Between rounds of selection, 

phage eluted from the affinity reagent were amplified in E. coli (Kay, B.KL J. 
Winter, and J. McCofferty, Phage Display of Peptides and Proteins, Academic 
Press, San Diego, California, 1996). 
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Expression of proteins in E. coli 

When large-scale quantities of recombinant protein are desirable for 
structure-function studies, DNA segments containing individual affinity-selected 
MPOicphGCSFs are subcloned into any of a variety of bacterial plasmid expression 
5 vectors, and expressed as a cytoplasmic product or as a secreted protein in E. colu 

E. coli strain MON105 or JM101 harboring the plasmid of interest are 
grown at SVC in M9 plus c as amino acids medium with shaking in an air incubator 
Model G25 from New Brunswick Scientific (Edison, NJ). Growth is monitored at 
OD^ until it reaches a value of 1.0 at which time nalidixic acid (10 mg/mL) in 0.1 

10 N NaOH is added to a final concentration of 50 pg/mL, for cultures -containing 

plasmids with the E. coli recA promoter driving expression of the recombinant 
gene. IPTG is used in place of nalidixic acid, as a chemical inducer to facilitate 
expression from plasmids containing the lac promoter or hybrid lac promoters. The 
cultures are then shaken at 37°C for three to four additional hours. A high degree 

15 of aeration is maintained throughout the culture period in order to achieve 

maximal production of the desired gene product. The cells are examined under a 
light microscope for the presence of inclusion bodies (IB). One mL aliquots of the 
culture are removed for analysis of protein content by boiling the pelleted cells, 
treating them with reducing buffer and electrophoresis via SDS-PAGE (see 

20 Maniatis et al., "Molecular Cloning: A Laboratory Manual", 1982). The culture is 

centrifuged (5000 x g) to pellet the cells. 

Isolation of Inclusion Bodies 

The cell pellet from a 330 mL E. coli culture is resuspended in 15 mL of 
sonication buffer (10 mM 2-amino-2-(hydroxymethyl) 1,3-propanediol 

25 hydrochloride (Tris-HCl), pH 8.0 + 1 mM ethylenediaminetetraacetic acid (EDTA). 

These resuspended cells are sonicated using the microtip probe of a Sonicator Cell 
iyisruptor (Model W-375, Heat Systems-Ultrasonics, Inc., Farmingdale, New York). 
Three rounds of sonication in sonication buffer followed by centrifugation are 
employed to disrupt the cells and wash the inclusion bodies (IB). The first round of 

30 sonication is a 3 minute burst followed by a 1 minute burst, and the final two 

rounds of sonication are for 1 minute each. 

Purification 

The folded proteins can be affinity-purified using affinity reagents such as 
monoclonal antibodies or receptor subunits attached to a suitable matrix. 
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Purification can also be accomplished using any of a variety of chromatographic 
methods such as: ion exchange, gel filtration or hydrophobic chromatography or 
reversed phase HPLC. These and other protein purification methods are described 
in detail (Methods in Enzymology, Volume 182 "Guide to Protein Purification- 
edited by Murray Deutscher, Academic Press, San Diego, California, 1990). 

Protein Characterization 

The purified protein is analyzed by RP-HPLC, electrospray mass 
spectrometry, and SDS-PAGE. The protein quantitation is done by amino acid 
composition, RP-HPLC, and Bradford protein determination. In some cases tryptic 
peptide mapping is performed in conjunction with electrospray mass spectrometry 
to confirm the identity of the protein. 

Baf-3/G-CSF receptor assay 

Briefly, the mouse lymphoid cell line Baf3 was transfected with human 
granulocyte colony stimulating factor receptor (hG-CSFR) cDNA. Stable clones of 
Baf3 which expressed the G-CSFR and proliferated in the presence of hG-CSF 
were isolated and used to investigate the activity of human G-CSF receptor 
agonists without the influence of other human cytokine receptor responses. 

The cDNA encoding hG-CSFR (a gift from Dr. Daniel C. Link (Washington 
* University, St. Louis, MO) was released from the plasmid pEMCVSralpha as a 
Hindlll/EcoRI (5* to 3') fragment, gel-purified, and inserted into the mammalian 
cell expression plasmid pcDNA3 (Invitrogen, San Diego, CA). This plasmid 
contains enhancer-promoter sequences from the immediate early -gene of the 
human cytomegalovirus (CMV), a bovine growth hormone polyadenylation signal 
and transcription termination sequences, a neomycin resistance gene is presentfor 
the selection of G418 stable cell clones, and an ampicillin resistance -gene for 
selection in E. coli. Ligation mixtures were transformed into E. coli strain TGI 
(delta (lac-pro), supE, thi t /i*£D5/F(*raD36, proPCK, lacl\ ZacZdeltaM*51 and 
plasmid DNA was purified using a Qiagen Midiprep Plasmid Kit. The structure of 
plasmid DNAs containing hG-CSFR were confirmed by restriction enzyme analysis 
and by automated DNA sequence analysis using an ABI sequencing machine. One 
of several plasmids with the correct structure was selected and given the 
designation pMON30298. 



Baf3 cells, maintained in complete growth medium (RPMI 1640 
supplemented to 10% FBS and 10% Wehi 3B supernatant as a source for mouse IL- 
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3), were seeded at a subconfluent cell density of 10-5 cells/ml in growth media 
(RPMI 1640 5% FBS; 2 mM L-glutamine) the day prior to the electroporation. The 
cells were collected and rinsed twice in 10 ml serum-free RPMI 1640. The cells 
were diluted to 10*6/ml in serum-free RPMI and 1 ml was placed into each 
electroporation chamber (Gibco/BRL #1608AJ). 130 ug of plasmid DNA was added 
to each chamber and the chamber* were incubated on ice for 30 minutes prior to 
electroporation. The cells were electroporated on ice at a capacitance of 800 uF, 
400V, fast charge, and low ohms in a BRL CellPorator. The cells were immediately 
removed from the chambers and placed into 10 cm dishes containing 10 ml of 
growth medium. The cells were allowed to recover for 48 hr in growth media prior 
to selection. 

After the recovery period, the cells were pelleted at 1000 rpm for 10 
minutes, and resuspended into 10 ml of selection medium (growth medium 
containing 800 ug/ml G418 sulfate (Gibco/BRL). The ceUs were kept in selection 
media, being passaged twice weekly, until only a few viable cells could be seen in 
the mock transfected control cell dishes (approximately 2 weeks). After an 
additional 2 weeks in selection media, the cells which had been electroporated with 
the hG-CSFR cDNA had grown to a cell density which allowed them to be tested 
for proliferation in the presence of hG-CSF (Fukunaga, R. et al., EMBO J. 10 (10): 
2855-2865, 1991). 

The cell proliferation assay conditions are as follows: Briefly, 25,000 cells 
were plated in a microtiter 96 weU plate with or without cytokine in IMDM 
medium supplemented with BSA (50 ug/ml), human transferrin <100 ug/ml). lipid 
(50 ug/ml) 2-mercaptoethanol (50 uM final concentration). Each well was 
incubated with 0.5 uCi of 'H-thymidine (16 hours) and the incorporated 
radioactivity was measured. Triplicate wells containing Baf3 cells were set up 
with 4 nM hG-CSF, 4 nM mIL-3 or media only control. Samples of different 
permuted proteins were tested in each assay. 

Example 1: Construction of a permutein library without a linker region 

Figure 1 shows a schematic of scanning permutagenesis. A plasmid 
construct comprising a tandem repeat of the modified human granulocyte colony 
stimulating factor (hG-CSF with a serine for amino acid 17) gene joined by a 
sequence (GCCGG, termed a zero order linker) was generated and subcloned into 
the plasmid pACYC177 (Chang, A.C.Y. and S.N. Cohen, J Bacterial. 134: 1141- 
1156, 1978) using standard molecular biology methods (Sambrook, J. et al., 
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Molecular Cloning, A Laboratory Manual, 2 nd edition, Cold Spring Harbor Press, 
New York, 1989). The resultant plasmid construct (pMON 15978) was linearized by 
restriction digestion (Smal) and used as a template for PCR amplification of 
circularly permuted hG-CSF (cphG-CSF) genes, following the method of Horlich 
> (Horlick, BJL et al., Protein Engineering 5: 427-431, 1992. For purposes of this 

demonstration of the scanning permutagenesis technique, we chose to make a 
limited permutein library rather than one containing every possible cphG-CSF. 
Figure 2 shows the position of the new amino termini for each new cphG-CSF. 

Individual cphG-CSF genes were inserted into phagemid presentation 
0 vector pCANTAB 5E (Pharmacia Biotech,) such that they were expressed as a part 

of a MPO species (Feng, Y., N. R. Staten, C. M. Baum, N. L. Summers, M. Caparon, 
S. C. Bauer, L. Zurfluh, J. P. McKearn, B. K. Klein, S. C. Lee, C. A. McWherter. 
1997. Multi-functional hematopoeitic receptor agonists. World Patent Application 
WO 97/12985) which was in turn fused to the amino end of the phage genelll 
5 product. The presented fusion protein contained, starting from its amino terminus, 

a hIL-3 receptor agonist, cphG-CSF, and the phage gene III product. The juncture 
between the presented protein and the gene III product was as previously 
described (Merlin, S. et al., Applied Biochemistry and Biotechnology 67: 15-29, 
1997). 

>0 After confirmation of the structure of each phagemid construct, phagemid 

particles were produced for each individual cphG-CSF-presenting species (Merlin 
et al., 1997). Some of these lots of particles were used to individually define the 
affinity properties of specific presented cphG-CSF species in analytical biopanning 
experiments (Caparon, M. H. et al., Molecular Diversity 1: 241-246, 1996; Merlin, 

25 S. et al., Applied Biochemistry and Biotechnology 67: 15-29, 1997), but all of the 

phage particle lots were titered and equivalent numbers of transducing units of 
each particle preparation were pooled together to form the sc annin g 
permutagenesis library for hG-CSF in an MPO background. Figure 2 shows the 
MPO: cphG-CSF species present in the library. 

30 MPO : cphG-CSF 38/37 is an example of the nomenclature used to specify 

the identity of individual permuted proteins. It describes a MPO molecule 
con tainin g a circularly permuted human G-CSF module (with the serine 17 
substitution). The first amino acid of the cphG-CSF domain is amino acid 38 of the 
parent protein, and the last amino acid is residue 37 of the parent. 

35 Example 2: Presentation and Affinity screening of the MPO: cphGCSF 
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library 

MPO: cphG-CSF 38/37, is a full hG-CSF receptor agonist {McKearn, J.P., 
Myelorestorative activities of synthokine and myelopoietin. In Proceedings of the 
1996 IBC Conference on Therapeutic Applications of Cytokines, pp. 4.3.1-4.3.18, 
1996). It was presented on filamentous phage as a positive control to demonstrate 
that permuted proteins can be presented on the surface of phage particles and 
affinity selected. After phagemid particles were produced from this construct, they 
were subjected to analytical biopanning using cells expressing the hG-CSF receptor 
as affinity reagent. 

Table 1 shows that phage presented MPO: cphG-CSF 38/37 was affinity 
selected by cells expressing the hG-CSF. MPO: cphGCSF 38/37-GPHI fusion was 
expressed, secreted and assembled into phagemid particles, and could be affinity 
selected by the hG-CSF receptor. Permutagenesis of a protein does not appear to 
impair its successful presentation. 

Relative to typical phage display libraries, the complexities of cp libraries 
are low, containing perhaps hundreds to thousands of individuals. The 
demonstration library here contained about 50 distinct clones, as opposed to more 
typical phage libraries containing more than 10 s individuals (reviewed in Clackson, 
T. and J A. Wells, Tibtech 12: 173-184, 1994). 

37 randomly chosen selectants from round 1, and fewer 'from subsequent 
rounds (17, 11 and 14 were picked from rounds 2, 3 and 4, respectively) were 
chosen for sequence analysis. The identity of the MPO: cphG-GSFs identified in 
each round is shown in Figure 2. 

A total of 14 MPO: cphGCSF species were identified from the output of 
affinity selection (Figure 2). Most of the MPO: cphGCSF species identified from 
the library had new carboxy and amino termini in loop segments (9 of 14 
permuteins identified), rather than in clearly defined secondary structures {See 
Hill et al., 1993 for the hG-CSF structure). Five selectants had termini within 
helical domains of hG-CSF (MPO: cphG-CSFs 13/12, 19/18, 71/70, 123/122 and 
159/158). For three of these molecules (MPO: cphG-CSFs 13/12, 71/70 and 
123/122) their new ends lie at the outermost ends of helices, and therefore 
perturbation of secondary structure caused by these permuteins may be minimal. 
However, MPO: cphG-CSF 19/18 and MPO: cphG-CSF 159/158 have new termini 
well within helix 1 and helix 4 of hG-CSF, respectively. 
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These data parallel the observations of Graf and Schachman, who 
developed a limited DNase I digestion method for "random" permutagenesis~(Graf, 
R. and H. K Schachman, Proc Natl Acad Sci USA 93:11591-11596, 1996). They 
identified two permutein species of aspartate transcarbamoylase that introduced 
5 new amino and carboxy ends into secondary structural domains and that retained 

biological activity. In their work, the majority of permuteins introducing ends into 
secondary structures (5/7 identified) were significantly diminished in activity. In 
contrast, we found a several permuteins that introduced helical breaks retained 
activity (See Below). The method used by Graf and Schechman frequently 
10 introduces point mutations, small insertions and deletions into the permuted 

proteins, potentially complicating the analysis of the effects of permut agenesis. 

Example 3: Biological activity of MPO: cphG-CSFs selected from the cp 
phage library 

To obtain sufficient protein for analysis of the bioactivity of individual 
15 MPO molecules con tainin g cphG-CSFs, DNA segments containing individual 

affinity-selected MPO: cphGCSFs were subcloned into a mammalian expression 
vector, and expressed transiently in BHK cells as described above. 

The MPO: cphG-CSFs isolated from biopanning were all expressed 
transiently in mammalian cells and the amount of MPO: cphG-CSF in each 
20 supernatant was determined by sandwich hIl-3 ELISA (Oiins P.O. et al., «7. SioL 

Chem. 270: 23754-23760, 1995). The quantitated supernatants were then assayed 
for G-CSF receptor agonist activity in a Baf-3/G-CSF receptor assay (Figure 3, 
Table 4). 

All but one of the transiently expressed MPO: cphG-CSF proteins exhibited 
25 G-CSF activity equivalent to or slightly better than that of the parent MPO 

molecule, including those MPO: cphG-CSFs with new carboxy and amino ends 
within helixes. The permutein encoded by pMON16021 with a breakpoint between 
positions 48 and 49 did not exhibit activity in the G-CSF-dependent proliferation 
assay. These data suggest that most of the proteins isolated from the library are 
30 competent to bind the hG-CSF receptor and produce a proliferation signal. 

All references, patents, or applications cited herein are incorporated by 
reference in their entirety, as if written herein. 
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Tables 



Table 1: Circularly permuted proteins 



Protein 



Reference 



Enzymes 
T4 lysozyme 

dihydrofolate reductase 



ribonuclease Tl 



Bacillus p-glucanase 



aspartate transcarbamoylase 



phosphoribosyl anthranilate 

isomerase 
pepsin/pepsinogen 
glyceraldehyde-3-phosphate 

dehydrogenase 
ornithine decarboxylase 
yeast phosphogly cerate 

dehydrogenase 



Zhang et aL, Biochemistry 32:1-2311-12318 (1993); 
Zhang et al., Nature Struct Biol. 1:434-438 (1995) 
Buchwalder et al., Biochemistry 31:1621-1630 
(1994); 

Protasova et al., Prot. Eng. 7:1373-1377 (1995) 
Mullins et al., J. Am. Chem/Soc. 116:5529-5533 
(1994); 

Garrett et al., Protein Science 5:204-2 11 <1996) 
Hahn et al., Proc. Natl Acad. ScL U.SJL 91:10417- 

10421 (1994) 
Yang and Schachman, Proc. Natl. Acad. 

Transcarbamoylase ScL U.SA. 90:11980-11984 

(1993) 

Luger et al., Science 243:206-210 (1989); 
Luger et al., Prot. Eng. 3:249-258 (1990) 
Lin et al., Protein Science 4:159-166 (1995) 
Vignais et al., Protein Science 4:994-1000 1 1995) 

li & Coffino, Mol. Cell. Biol. 13:2377-2383 <1993) 
Ritco-Vonsovici et al., Biochemistry 34:16543- 
16551 (1995) 



Enzyme Inhibitor 

basic pancreatic trypsin inhibitor Goldenberg & Creighton, J. Mol. Biol. 165:407-413 

(1983) 



Cytokines 
interleukin-lp 
interleukin-4 



Horlick et al., Protein Eng. 5:427-431 (1992) 
Kreitman et ah, Cytokine 7:311-318 (1995) 



Tyrosine Kinase Recognition 
Domain 
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a-spectrin SH3 domain Viguera et al., J. Mol. Biol. 247:670-681 (1995) 

Transmembrane Protein 

omp A Koebnik & Kramer, J. Mol Biol. 250:617^626 

(1995) 

Chimeric Protein 

interleukin-4-Pscudomonos Kreitman et aL, Proc. Natl. Acad. Sci. I7.S.A 

exotoxin fusion molecule 91:6889-6893 (1994) 
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Table 2: Strains 



Designation Description or Genotype 



Reference/Source 



DH5a™ 



JM101 

(ATCC# 
33876) 

MON105 

(ATCC# 

55204) 



TGI 



F,p>u80 dZacZdeltaM15, 
delta«acZYA-argF)U169, efeoR, 
recAl, endAl, hsdRH (rk ,mk*), 
phoA, supE44, lambda-, tki-1, 
gyr A96, relAl 

delta (pro lac), supE, thi, 
F(traD36, proA'B*. lacT, 
ZacZdeltaM15) 

F, lambda-,IN (rrnD, rrnE)l, 
rpoD\ rpoH358 



life Technologies, Rockville, 
Maryland 



Yanisch-Perron et al., Gene, 33: 
103-119, 1985 

Obukowicz et sl.,Appl. and Envir. 
Micr., 58: 1511-1523, 1992 



MON208 W3110 rpoH358, tecl", ompT. Jtan Alan Easton 



deltadac-pro), supE, thi-1, 
JisdD5/F(*raD36,proA*B*, lacV, 
ZacZdeltaM15) 



Amersham Corp., Arlington 
Heights, Illinois 



W3110 



IN (rmD-rmE)l, rphl 



Lab collection 
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Table 3: Plasmids 



Plasmid 



SEQ Selectable 
ID Marker 
NO. 



Description 



Source 



pACYC177 



Kan R 
Amp R 



pMON15978 



Amp* 



pCANTAB 5E 



Amp R 



PMON16016 



Amp* 



Plasmid with multiple cloning 
sites and two selectable 
markers 



Chang, 
A.C.Y. and 
S.N. Cohen, 
J Bacteriol. 
134: 1141- 
1156, 1978 

Plasmid construct comprising a This work 

tandem repeat of the modified 

human granulocyte colony 

stimulating factor (hG-CSF 

with a serine for amino acid 17) 

gene joined by a sequence 

(GCCGG, termed a zero order 

linker), subcloned into the 

plasmid pACYC177 

Phage display vector containing Pharmacia 

lac promoter operably linked to Biotech, 

fd gene 3 signal sequence, a Piscataway, 

linker region, an E tag, and an NJ 

fd gene 3 structural gene all 

cloned into the vector backbone 

of pUC119 containing ColEl 

ori, the beta lactamase 

resistance gene, and an M13 

ori. 



Phagemid presentation vector 
pCANTAB 5E derivation 
containing inserted individual 
cphG-CSF gene such that it 
was expressed as a part of an 



This work 



RNRnnnin- <wn ooiriwrai ia> 



WO 00/18905 



30 



PCT/US99/20891 



MPO species, fused in turn to 
the amino terminus end of the 
phage genelll product. The 
first amino acid of the cphG- 
CSF domain is amino acid 1 of 
the parent, and the last amino 
acid is residue 174 of the 
parent. The zero order linker is 
attached at the carboxyl end of 
amino acid 174. 



pMON16017 Amp R 



Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 3 of the parent, and the 

last amino acid is residue 2 of 

the parent. 



pMON16029 Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 7 of the parent, and the 
last amino acid is residue 6 of 
the parent. 

pMON16030 ^pR Identical to P MON1«016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 9 of the parent, and the 
last amino acid is residue 8 of 
the parent. 

pMON16018 Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 11 of the parent, and the 
last amino acid is residue 10 of 
the parent. 
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pMON16019 



Amp 1 



pMON16031 



Amp R 



pMON16020 



Amp R 



pMON16032 



Amp R 



pMON16033 



Amp R 



pMON16034 



Amp R 



Identical to pMON1601€ except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 13 of the parent, and the 

last amino acid is residue 12 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 15 of the parent, and the 

last amino acid is residue 14 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 19 of the parent, and the 

last amino acid is residue 18 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 22 of the parent, and the 

last amino acid is residue 21 of 

the parent. 

Identical to pMONl601€ except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 27 of the parent, and the 

last amino acid is residue 26 of 

the parent. 

Identical to pMONI6016 except This work 
the first amino acid of the 
cphG-CSF domain is amino 
acid 31 of the parent, and the 
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pMON16035 



Amp 1 



pMON16036 



Amp R 



pMON16037 



Amp R 



pMON16038 



Amp R 



pMON16039 



Amp R 



last amino acid is residue 30 of 
the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 35 of the parent, and the 

last amino acid is residue 34 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 37 of the parent, and the 

last amino acid is residue 36 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 38 of the parent, and the 

last amino acid is residue 37 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 39 of the parent, and the 

last amino acid is residue 38 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 43 of the parent, and the 

last amino acid is residue 42 of 

the parent. 



pMON16040 Amp R 



Identical to pMON16016 except This work 
the first amino acid of the 
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pMON16041 



Amp 1 



pMON16022 



Amp R 



pMON16042 



Amp 1 



pMON16043 



Amp 1 



pMON16044 



Amp R 



cphG-CSF domain is amino 
acid 45 of the parent, and the 
last amino acid is residue 44 of 
the parent. 

Identical to pMON 160 16 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 47 of the parent, and the 

last amino acid is residue 46 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 49 of the parent, and the 

last amino acid is residue 48 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 51 of the parent, and the 

last amino acid is residue 50 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 53 of the parent, and the 

last amino acid is residue '52 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 56 of the parent, and the 

last amino acid is residue 55 of 

the parent. 
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pMON16023 Amp 11 Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 60 of the parent, and the 
last amino acid is residue 59 of 
the parent. 

pMON!6045 Amp 11 Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 64 of the parent, and the 
last amino acid is residue 63 of 
the parent. 



pMON16024 Amp R 



Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 67 of the parent, and the 

last amino acid is residue 66 of 

the parent. 



pMON16046 Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 69 of the parent, and the 
last amino acid is residue 68 of 
the parent. 

pMON16025 Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 71 of the parent, and the 
last amino acid is residue 70 of 
the parent. 

pMON 16047 Amp R Identical to pMON16016 except This work 

the first amino acid of the 
cphG-CSF domain is amino 
acid 73 of the parent, and the 
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pMON16048 



Amp 1 



pMON16049 



Amp R 



P MON16050 



Amp R 



pMON16051 



Amp 11 



pMON16052 



Amp R 



pMON16053 



Amp R 



last amino acid is residue 72 of 
the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 84 of the parent, and the 

last amino acid is residue 83 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 98 of the parent, and the 

last amino acid is residue 97 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 100 of the parent, and the 

last amino acid is residue 99 of 

the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 102 of the parent, and the 

last amino acid is residue 101 

of the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 112 of the parent, and the 

last amino acid is residue 111 

of the parent. 

Identical to pMON16016 except This work 
the first amino acid of the 



WO 00/18905 



pMON16026 AmpR 



P MON16027 Amp R 



pMON16054 AmpR 



pMON 16055 Amp R 



pMON16056 Amp R 
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cphG-CSF domain is amino 
acid 121 of the parent, and the 
last amino acid is residue 120 
of the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 123 of the parent, and the 

last amino acid is residue 122 

of the parent. 

Identical to pMON16016 except This work 
the first amino acid of the 
cphG-CSF domain is amino 
acid 125 of the parent, and the 
last amino acid is residue 124 
of the parent. 

Identical to pMON16016 except 
the first amino acid of the 
cphG-CSF domain is amino 
acid 133 of the parent, and the 
last amino acid is residue 132 
of the parent. 

Identical to pMON16016 except 
the first amino acid of the 
cphG-CSF domain is amino 
acid 142 of the parent, and the 
last amino acid is residue 141 
of the parent. 

Identical to pMON16016 except This work 
the first amino acid of the 
cphG-CSF domain is amino 
acid 143 of the parent, and the 
last amino acid is residue 142 
of the parent. 



This work 



This work 
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pMON 16057 



Amp 1 



pMON16028 



Amp** 



pMON16058 



Amp 1 



pMON16059 



Amp R 



Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 147 of the parent, and the 

last amino acid is residue 146 

of the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 159 of the parent, and the 

last amino acid is residue 158 

of the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 168 of the parent, and the 

last amino acid is residue 167 

of the parent. 

Identical to pMON16016 except This work 

the first amino acid of the 

cphG-CSF domain is amino 

acid 170 of the parent, and the 

last amino acid is residue 169 

of the parent. 
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Table 4: Analytical biopanning 



Before receptor* 


After receptor* 


Enrichment 


l/6.6xl0 4 


1/6.5X10* 1 


990-fold 



* Amp'VKan* resistant colonies 



Analytical biopanning shows that MPO molecules containing permuted hG- 
CSF domains can be presented and affinity selected in a hG-CSF receptor 
dependent fashion. A mixture of phagemids presenting MPO: cphG-CSF 38/37 
(ampicillin resistant) and M13k07 (kanamycin resistant) were exposed to BHK 
cells with or without the hG-CSF receptor on their surface, washed and eluted 
from the cell surface. Eluted phage were introduced into E. coli and the 
transfected cells were plated on media containing kanamycin or ampicillin. The 
ratio of ampicillin resistant to kanamycin resistant particles were determined prior 
to and following exposure to receptor by counting resistant colonies. 
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Table 5: Activity of selected permuteins 



Plasmid 


Permute in breakpoint 
in G-CSF amino acid 
sequence 


Activity in G-CSF- 
dependent proliferation 
assay 


P MON16017 


3/2 


+ 


pMON16018 


11/10 


+ 


P MON16019 


13/12 


+ 


pMON16020 


19/18 


+ 


P MON16021 


49/48 




P MON16022 


60/59 


+ 


pMON16023 


67/66 


+ 


PMON16024 


69/68 


+ 


pMON16025 


71/70 


+ 


P MON16026 


123/122 


+ 


pMON16027 


125/124 


+ 


pMON16028 


159/158 


+ 



0018905A1 IA> 
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Table 6: SEQ ID Number/SEQ ID Name Correlation 



1. 

2. 
3. 
4 . 

5. 
6. 
7. 

e. 

9. 

10. 

11. 

12. 
13- 
14. 
15. 
16. 
17. 
IB. 
19. 
20. 
21. 

22. 

23. 

24. 

25. 

26. 

27. 

28. 

29. 

30. 

31. 

32. 

33. 

34. 

35. 

36. 

37. 

38. 
39. 
40. 

41. 
42. 

.63. 



FCS1 

res 2 
re S3 

FCS4 

PCS 5 

FG56 

FGS7 

FGS8 

PGS9 

PGS10 

PCS11 

PGS12 

PCS 13 

FCS14 

PCS 15 

FCS16 

PCS 17 

Fcsie 

FCS19 

PC 520 

FGS21 

FGS22 

FGS23 

PGS24 

PGS25 

PCS26 

PCS27 

PGS28 

FCS29 

FG530 

PGS31 

PGS32 

PCS33 

FC534 

FCS3 5 

PGS36 

PCS 3 7 

PCS38 

PCS39 

PG540 

PGS41 

PGS42 

PG543 



CCCCCCC ACATC 
CCCCCCC ACATC 
CCCCCCC ACATG 
CCC CCCC ACATC 
ACATC 



44. PCS44 

45. PCS45 

46. POS46 

47. PCS47 
46. FCS4B 

49. PCS49 

50 . PGS50 

51. PCS51 

52. PCS52 

53. FCS53 

54. PCS 5 4 

55. PCS55 

56. PGS56 

57 . PC55" 

58. PG556 

59. FCS59 
f0 PGS60 

61. PCS 61 

62 . PCS 62 

63 . PCS 63 

64 . FCS64 
15 . FOS65 
66 . PCS 6 6 

67 . rss*- 

68. PCS68 

69. PCS69 

*?0 . PCS7-0 

- : rav?. 

Ti . PCS" 1 r 

72 . PCS73 

74 . FOS74 

75. POS75 

76. PCS76 

77. FCS77 

78. PCS7e 
75. PCS79 



CCCCCCC ACATC 
CCCCCCC ACATC 
CCCCC CC ACATC 
ACATC 
ACATC 
ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
ACATC 
ACATC 
CCCCCCC ACATC 
CCC CCCC ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
CCCCCCC ACATC 
CCCQCGC ACATC 
ACATC 
ACATC 
ACATC 
CCCCCCC ACATC 
C CCCCCC ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
ACATC 
ACATC 
ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
ACATC 
ACATC 
CCCCCCC ACATC 
CCCCCCC ACATC 
C CCCCCC ACATC 
CCCCCCC ACATC 
C CCCCCC ACATC 
CCCCCCC ACATC 



CCCCCCC 



ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 



CCCCCCC 
CCCCCCC 
CCCCCCC 
CCCCCCC 



CCCCCCC 



CCCCCCC 

rsccccc 

CCCCCCC 



rcccccc 



CCCCCCC 
CCCCCCC 
CCCCCCC 



ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 
ACATC 



TCT 


ACA 


CCA 








CCC 




TCT 


CCA 


TTC 


CCC 


CCT 


CCC 


ACC 


TCC 


TCT 


TTC 


GCC 


CCT 


CCC 


ACC 


TCC 


CTG 


TCT. 


CCC 


CCT 


CCC 


ACC 


TCC 


CTC 


CCC 


TCT 


CCT 


GCC 


ACC 


TCC 


CTG 


CCC 


CAC 


TCT 


CCC 


ACC 


TCC 


CTC 


CCC 


CAG 


ACC 


TCT 


ACC 


TCC 


CTC 


•CCC 


CAC 


ACC 


TTC 


TCT 


TCC 


CTC 


CCC 


CAC 


ACC 


TTC 


CTC 


TCT 


CTC 


CCC 


CAC 


ACC 


TTC 


CTC 


-CTC 


TCT 


CCC 


CAC 


ACC 


TTC 


. -CTC 


<CTC 


AAC 


TCT 


CAC 


ACC 


TTC 


CTC 


-CTC 


AAC 


TCT 


TCT 


ACC 


TTC 


■CTC 


-CTC 


AAC 


TCT 


TTA 


TCT 


TTC 


-CTC 


CTC 


AAC 


TCT 


TTA 


CAG 


TCT 


CTC 


CTC 


AAC 


TCT 


TTA 


GAG 


CAA 


TCT 


CTC 


AAG 


TCT 


TTA 


GAG 


CAA 


CTC 


TCT 


AAC 


TCT 


TTA 


GAG 


CAA 


CTO 


AGG 


TCT 


TCT 


TTA 


GAG 


CAA 


CTC 


AGG 


AAG 


TCT 


TTA 


GAG 


CAA 


CTG 


AGG 


AAG 


ATC 


TCT 


GAG 


■CAA 


CTC 


ACC 


AAC 


ATC 


CAG 


TCT 


CAA 


CTC 


ACC 


AAG 


ATC 


'CAG 


GCC 


TCT 


CTC 


ACC 


AAG 


ATC 


-CAG 


• CCC 


CAT 


TCT 


ACC 


AAG 


ATC 


CAC 


CCC 


CAT 


GCC 


TCT 


AAG 


ATC 


CAC 


CCC 


GAT 


CCC 


CCA 


TCT 


ATC 


CAC 


GCC 


CAT 


GCC 


CCA 


GCC 


TCT 


CAC 


GCC 


GAT 


GCC 


CCA 


CCC 


-CTC 


TCT 


CCC 


GAT 


GCC 


CCA 


CCC 


••CTC 


•CAC 


TCT 


GAT 


GCC 


CCA 


GCG 


CTC 


CAC 


GAG 


TCT 


CCC 


CCA 


GCC 


CTC 


CAG 


GAG 


AAC 


TCT 


CCA 


GCC 


CTC 


CAC 


GAG 


AAG 


CTC 


TCT 


GCG 


CTC 


CAC 


GAG 


AAC 


CTC 


TCT 


TCT 


CTC 


CAC 


CAC 


AAC 


"CTC 


TCT 


CCC 


TCT 


CAC 


GAG 


AAG 


•CTC 


TCT 


-CCC 


ACC 


TCT 


GAG 


AAG 


CTC 


TCT 


CCC 


ACC 


TAC 


TCT 


AAG 


'CTC 


TCT 


CCC 


ACC 


TAC 


AAC 


TCT 


CTC 


TCT 


GCC 


ACC 


TAC 


AAG 


CTC 


TCT 


TCT 


GCC 


ACC 


TAC 


AAG 


CTC 


TCC 


TCT 


CCC 


ACC 


TAC 


AAG 


CTC 


TCC 


CAC 


TCT 


ACC 


TAC 


AAC 


CTG 


TCC 


CAC 


CCC 


TCT 


TAC 


AAC 


"CTC 


TCC 


CAC 


~CCC 


GAG 


TCT 


AAC 


CTC 


TCC 


CAC 


CCC 


GAG 


CAC 


TCT 


CTC 


TCC 


CAC 


CCC 


GAG 


CAG 


CTG 


TCT 


TCC 


CAC 


■CCC 


GAG 


CAG 


CTC 


CTC 


TCT 


CAC 


CCC 


GAG 


CAC 


CTC 


CTG 


CTC 


TCT 


CCC 


GAG 


GAG 


.CTC 


CTC 


■CTG 


-CTC 


TCT 


GAG 


CAC 


-CTC 


CTC 


•CTC 


CTC 


GGA 


TCT 


GAG 


CTC 


CTC 


-CTC 


-CTC 


GGA 


CAC 


TCT 


CTC 


CTC . 


-CTC 


CTC 


CCA 


-CAC 


TCT 


TCT 


CTC 


CTC 


-CTC 


GGA 


CAC 


TCT 


CTC 


TCT 


CTC 


CTC 


GGA 


•CAC 


TCT 


CTG 


GCC 


TCT 


CTC 


CCA 


CAC 


TCT 


CTC 


•CCC 


ATC 


TCT 


CCA 


CAC 


TCT 


CTC 


GCC 


ATC 


CCC 


TCT 


CAC 


TCT 


CTC 


CCC 


ATC 


CCC 


TCG 


TCT 


TCT 


CTC 


GCC 


ATC 


-CCC 


TCG 


CCT 


TCT 


CTC 


CSC 


ATC 


•CCC 


TCG 


CCT 


-CCC 


TCT 


GCC 


ATC 


CCC 


TCG 


CCT 


CCC 


CTC 


TCT 


ATC 


CCC 


TCC 


CCT 


CCC 


CTC 


ACC 


TCT 


CCC 


TCG 


CCT 


CCC 


•CTC 


ACC 


TCC 


TCT 


TCC 


CCT 


CCC 


CTC 


ACC 


TCC 


TCC 


TCT 


CCT 


CCC 


CTC 


ACC 


TCC 


TCC 


CCC 


TCT 


CCC 


CTC 


ACC 


TCC 


TCC 


CCC 


ACC 


TCT 


CTC 


ACC 


TCC 


TCC 


CCC 


ACC 


CAC 


TCT 


ACC 


TCC 


TCC 


CCC 


ACC 


CAG 


CCC 


TCT 


TCC 


TCC 


CCC 


ACC 


CAC 


GCC 


•CTC 


TCT 


TCC 


CCC 


ACC 


CAC 


CCC 


CTC 


CAG 


TCT 


CCC 


ACC 


CAC 


CCC 


CTG 


CAG 


CTC 


TCT 


ACC 


CAC 


CCC 


-CTC 


-CAC 


«CTC 


CCA 


TCT 


CAC 


CCC 


CTC 


CAC 


CTG 


CCA 


CCC 


TCT 


GCC 


CTC 


CAC 


CTC 


CCA 


CCC 


TCC 


TCT 


CTC 


CAC 


CTC 


CCA 


CCC 


TCC 


TTC 


TCT 


CAC 


CTC 


CCA 


CCC 


TCC 


TTC 


ACC 


TCT 


CTC 


CCA 


CCC 


TCC 


TTC 


ACC 


CAA 


TCT 


CCA 


CCC 


TCC 


TTC 


ACC 


•CAA 


-CTC 


TCT 


GCC 


TCC 


TTC 


ACC 


CAA 


CTC . 


■CAT 


TCT 


TCC 




ACC 


CAA 


CTC 


CAT 


ACC 




TTC 




CAA 


CTC 


CAT 


ACC 


CGC. 






7AA 


CTC 


CAT 


ACC 


GCC 


•CTT 


TCT 


CAA 




-CAT 


ACC 


CCC 




TTC 


TCT 




CAT 


ACC 


GCC 






CTC 




CAT 


ACC 


"C3C 


CTT 


TTC 




TAC 
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6 C 


PCS80 




ACATC 


TCT 


AGC 


GGC 


CTT 






TAC 


-CAG 




FGS81 


CGCGCGC 


ACATC 


TCT 


GGC 


CTT 


TTC 




TAC 


CAG 


'GGG 


6" 


PCS 8 2 


CGCGCGC 


ACATG 


TCT 


CTT 


TTC 


CTC 


TAC 


CAG 


GGG 


CTC 


83 


FCS83 


CGCGCGC 


ACATC 


TCT 


TTC 


CTC 


TAC 


CAG 


GCC 


CTC 


CTG 


84 


FG584 




ACATC 


TCT 


CTC 


TAC 


CAG 


GGG 


CTC 


CTG 


CAG 




PGS85 


CGCGCGC 


ACATC 


TCT 


TAC 


CAG 


GGG 


CTC 


CTG 


CAG 


GCC 


86 


FGS86 


CGCGCGC 


ACATC 


TCT 


CAG 


GGG 


CTC 


CTG 


■CAG 


GCC 


CTC 


B7 . 


PGS87 




ACATC 


TCT 


GGG 


CTC 


CTG 


CAG 


CCC 


CTG 


GAA 


68. 


PCS88 


CGCGCGC 


ACATC 


TCT 


-CTC 


CTG 


CAG 


GCC 


CTG 


GAA 


GGG 


89 • 






ACATC 


TCT 


CTG 


CAG 


GCC 


CTG 


GAA 


-GGG 


ATA 


90 


PCS90 


CGCGCGC 


ACATC 


TCT 


CAG 


GCC 




GAA 


GGG 


ATA 


TCC 


91 . 


FGS91 


CGCGCGC 


ACATC 


TCT 


ccc 


CTG 


GAA 


GGG 


ATA 


TCC 


CCC 


92 


FS592 


CGCGCGC 


ACATC 


TCT 


CTG 


GAA 


GGG 


ATA 


TCC 


CCC 


GAG 


93 . 


PCS 9 3 


CGCGCGC 


ACATC 


TCT 


GAA 


GGG 


ATA 




CCC 


GAG 


TTG 


94. 


PGS94 


CGCGCGC 


ACATC 




GGG 


ATA 


TCC 


CCC 


GAG 


TTG 


GGT 


95 . 








TCT 


ATA 


TCC 


CCC 


GAG 


TTG 


GCT 


CCC 


96 . 


FG596 


CGCGCGC 


ACATC 


TCT 


TCC 


CCC 


GAG 


TTG 


GGT 


CCC 


ACC 


97 . 


PGS97 


CGCGCGC 


ACATC 


TCT 


CCC 


GAG 


TTG 


GGT 


"CCC " 


ACC 




98 . 








TCT 


GAG 






CCC 


ACC 


TTG 


GAC 


99 • 






ACATC 


TCT 


TTG 


GGT 


CCC 


ACC 




GAC 


ACA 


100 . 


FBSIOO 


CGCGCGC 








CCC 


ACC 


TTG 


GAC 


ACA 


CTG 


101 . 


PCS101 




ATATC 


TCT 


CCC 


ACC 




GAC 


ACA 


CTG 


CAG 


102 . 


PGS102 


CGCGCGC 


ACATC 




ACC 




GAC 


ACA 


CTG 




CTG 


103 . 


PG5103 


CGCGCGC 


ACATC 


TCT 




GAC 








.CTC 




104 . 


PGS104 


CGCGCGC 


ACATC 




GAC 


ACA 


CTC 


CAG 




GAC 




105 . 


PGS105 


CGCGCGC 


ACATC 


TCT 


ACA 


CTG 








GTC 


GCC 


106 . 


FCS106 


CGCGCGC 


ACATC 


TCT 


CTC 


CAG 




GAC 

• 


CTC 


GCC 


CA< " 


101 . 


PGS107 


CGCGCGC 


ACATC 


TCT 


-CAG 


CTG 


GAC 


CTC 


CCC 


GAC 




108. 


PCS108 


CGCGCGC 


ACATC 


TCT 


CTC 


GAC 


GTC 


GCC 


GAC 


TTT 


GCC 


109 . 


FGS109 


CGCGCGC 


ACATC 


TCT 


GAC 


CTC 


GCC 


GAC 


TTT 


GCC 


ACC 


110. 


PGS110 


CGCGCGC 


ACATC 


TCT 


CTC . 


GCC 


GAC 


2 . 2 


GCC 


ACC 


ACC 


111. 


PGS111 


CGCGCGC 


ACATC 


TCT 


GCC 


GAC 


TTT 


•ccc 


ACC 


ACC 


ATC 


112 . 


FGS112 


CGCGCGC 


ACATC 


TCT 


GAC 


TTT 


GCC 




ACC 


ATC 


TGG 


113 . 


PCS 113 


CGCGCGC 


ACATC 


TCT 


TTT 


GCC 


ACC 


ACw 


ATC 


TGG 


CAG 


114 . 


PGS114 


CGCGCGC 


ACATC 


TCT 


GCC 


ACC 


ACC 


ATC 


TGG 


CAG 


CAG 


115. 


FGS115 


CGCGCGC 


ACATC 


TCT 


ACC 


ACC 


ATC 


TGG 


CAG 


CAG 


ATG 


116 . 


PGS116 


CGCGCGC 


ACATC 


TCT 


ACC 


ATC 


TOG 


CAG 


CAG 


ATC 


GAA 


117 . 


PGS117 


CGCGCGC 


ACATC 


TCT 


ATC 


TCC 


CAG 


CAG 


ATG 


GAA 


GAA 


118 . 


FCS118 


CGCGCGC 


ACATC 


TCT 


TOO 


CAG 


CAG 


ATG 


GAA 


GAA 


CTG 


119 . 


PGS119 


CGCGCGC 


ACATC 


TCT 


CAG 


CAG 


ATG 


GAA 


GAA 


CTG 


GGA 


120 . 


FGS120 


CGCGCGC 


ACATC 


TCT 


CAG 


ATC 


GAA 


GAA 


CTG 


GGA 


ATG 


121. 


FGS121 


CGCGCGC 


ACATC 


TCT 


ATC 


GAA 


GAA 


••CTG 


'GGA 


ATC 


CCC 


122 . 


FGS122 


CGCGCGC 


ACATC 


TCT 


GAA 


GAA 


CTG 


GGA 


ATG 


GCC 


.CCT 


123 . 


FQS123 


CGCGCGC 


ACATC 


TCT 


GAA 


CTG 


GGA 


ATG 


GCC 


CCT 


CCC 


124 . 


PCS 12 4 


CGCGCGC 


ACATC 


TCT • 


CTG 


GGA 




GCC 


CCT 


GCC 


CTG 


125 . 


PGS125 


CGCGCGC 


ACATC 


TCT 


GGA 


ATC 


CCC 




GCC 


CTG 


CAG 


126 . 


PCS 126 


CGCGCGC 


ACATC 


TCT 


ATC 


CCC 


CCT 




CTG 


CAG 


CCC 


127 . 


FGS127 


CGCGCGC 


ACATC 


TCT 


GCC 


CCT 


GCC 


CTG 


CAG 


CCC 


ACC 


128 . 


PGS128 




ACATC 


TCT 


CCT 


CCC 












129 . 


PGS129 


^ CCCeSC 


ACATC 


TCT 


GCC 


CTC 


CAG 




ACC 
ACC 


CAG 




130 . 


PGS130 


CGCGCGC 


ACATC 


TCT 


CTG 


CAG 


CCC 






GGT 


CCC 


131 . 


PGS131 


CGCGCGC 


ACATC 


TCT 


CAG 


CCC 


ACC 


CAG 


GGT 


GCC 


ATG 


.132 . 


FGS132 


CGCGCGC 


ACATG 


TCT 


CCC 


ACC 


CAG 


GGT 


CCC 


ATG 


• CCG 


133 . 


FGS13 3 


CGCGCGC 


ACATG 


TCT 


ACC 


CAG 


GGT 






CCG 




134 . 


' FCS134 


CGCGCGC 


ACATG 


TCT 


CAG 


GGT 
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166 . PC5166 CGCGCCC ACATG TCT CGC 

16*7. FGS167 CGCGCCC ACATC TCT CTT 

168 . PCS168 CGC G CCC ACATC TCT CTA 

169. FCS169A CGCGCCC ACATC TCT CGC 
1*10. FGS170A CGCGCCC ACATC TCT CAC 

171 . FGS171A CGCGCCC ACATC TCT CTT 

172. FCS172A CGCGCCC ACATC TCT CCC 

173 . FGS173A . CGCGCCC ACATC TCT CAC 

174. FCS174A CGCGCCC ACATC TCT CCC 

175. FGS169B CGCGCCC ACATC TCT CCC 

176. FGS1708 CGCGCCC ACATC TCT CAC 

177. FGS171B CGC GCCC ACATC TCT CTT 

178 . FGS172B C G C GC C C ACATC TCT CCC 

179. rCSl73B CGCGCCC ACATC TCT CAG 

180. FGS174B CGCGCCC ACATC TCT CCC 

181. FGS169C CGCGCCC ACATC TCT CGC 

182. FGS170C CGCGCCC ACATC TCT CAC 

183. FGS171C CGCGCCC ACATC TCT CTT 

184. FG5172C CGCGCGC ACATC TCT CCC 
185- FGS173C CGCGCC C ACATG TCT CAC 

186. rGS174C C S CGC CC ACATG TCT CCC 

187 . RGSOA TATATAT CCGGCCGC AGC CAT 

188 . AGSOB TATATAT C CGGC C GC AGC CAT 

189. RGSOC TATATAT CCGGCCGC AGC CAT 

190. RGS1A TATATAT CCGGCCGC TCT AGC 

191 . RGS1B TATATAT CCGGCCGC TCT AGC 

192. RGS1C TATATAT GCGGCCGC TCT AGC 

193. RGS2A TATATAT G C G GC C GC TCG TCT 

194 . RGS2B TATATAT CCGGCCGC TCG TCT 

195. RGS2C TATATAT CCGGCCGC TCG TCT 

196 . RGS3A TATATAT CCGGCCGC CAA TCG 

197. RGS3B TATATAT GCGGCCGC CAA TCG 

198. RGS3C TATATAT CCGGCCGC CAA TCG 

199. RGS4 TATATAT GCGGCCGC GCC CAA 

200. RGS5 TATATAT CCGGCCGC AGC GCC 

201. RG56 TATATAT GCGGCCGC GGC AGC 

202. RCS7 TATATAT GCGGCCGC OCT GGC 

203 . RGS8 TATATAT CCGGCCGC GGA GCT 

204. RGS9 TATATAT GCG GC CGC CAG GGA 

205 . RGS10 TATATAT GCGGCCGC GCC CAG 

206. RGS11 TATATAT GCGGCCGC CTC GGC 

207. RGS12 TATATAT GCGGCCGC GCT CTC 

208. RGS13 TATATAT GCGGCCGC GAA GCT 

209. RGS14 TATATAT GCGGCCGC CAC GAA 

210 . RCS15 TATATAT GCGGCCGC GAG CAC 

211. RCS16 TATATAT GCGGCCGC CTT GAG 

212 . RGS17 TATATAT GCGGCCGC AGA CTT 

213 . RCS18 TATATAT GCGGCCGC TAA AGA 

214. RG519 TATATAT GCGGCCGC CTC TAA 

215 . RGS20 TATATAT GCGGCCGC TTC CTC 

216. RGS21 TATATAT GCGGCCGC CAC TTG 

217. RGS22 TATATAT GCGGCCGC CCT CAC 

218 . RG523 TATATAT CCGGCCGC CTT CCT 

219. RCS24 TATATAT GCGGCCGC CAT CTT 

220. RGS25 TATATAT GCGGCC G C CTC GAT 

221. RGS26 TATATAT CCGGCCGC GCC CTC 

222. RCS27 TATATAT GCGGCCGC ATC GCC 

223. RGS28 TATATAT GCGGCCGC GCC ATC 

224. RGS29 TATATAT GCGGCCGC TCC CCC 

225. RGS30 TATATAT GCGGCCGC CCC TCC 

226. RG531 TATATAT GCG GC C G C GAG CCC 

227. RGS32 TATATAT GCGGCCGC CTC GAG 

228. RGS33 TATATAT GCGGCCGC CTC CTC 

229. ROS34 TATATAT GCGGCCGC CTT CTC 

230. RCS35 TATATAT CCGGCCGC CAC CTT 

231 . RCS36 TATATAT CCGGCCGC ACA CAC 

232. RCS37 TATATAT GCGGCCGC GCC ACA 

233 . RC538 TATATAT GCGGCCGC CCT GGC 

234. RCS39 TATATAT GCGGCCGC GTA GCT 

235 . RGS40 TATATAT GCGGCCGC CTT GTA 

236. RGS41 TATATAT CCG GC C GC CAG CTT 

237. KSSA2 TATATAT GCGGCCGC CCA CAC 
! . fCS4J TATATAT GCGGCCGC CTC CCA 

239. RCS44 TATATAT CCGGCCGC GCC CTC 

240. RCS45 TATATAT GCGGCCGC CTC GCC 
Zil. RC546 TATATAT CCGGC CG C CTC CTC 
til. ACS 4 7 TATATAT GCOGCCCC CAG CTC 
2A2. R3S48 TATATAT GCGGCCGC CAC CAG 

244 . RGS49 TATATAT C CGGC C GC CAG CAC 

245 . ROS50 TATATAT GC GGC C G C GAG CAG 

246. ROSSI TATATAT GC GGC C G C TCC GAG 

247. RCS52 TATATAT CCGGCCGC CTC TCC 

248. MS53 TATATAT CCGGCCGC AGA CTC 

249. RCS54 TATATAT CCG GC C G C CAC ACA 

250. RCS55 TATATAT CCGGCCGC GCC CAG 

251. F.CS56 TATATAT GCGGCCGC CAT CCC 
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252. RGS57 TXTATAT UU^UCCGC CGC CAT 

253. RGS58 TATATAT U CGOCCCC ^CA GOG 
254 1 RGS59 TATATAT GCCGCCGC AGC CCA 
255. RCS6 0 TATATAT CCGGCCGC GGG ACC 

256 RCS61 TATATAT CCCGCCCC CAC COG 

257 RGS62 TATATAT GCGCCCGC OCT CAG 

258 RCS63 TATATAT GCGCCCGC GGA GCT 
259. RGS64 TATATAT CCGGCCGC GCA GGA 
260 RGS65 TATATAT CCGGCCGC GGG GCA 

261. RGS66 TATATAT CCCCCCGC GCT GGG 

262. RGS67 TATATAT CCGGCCGC CTG GCT 

263 RGS68 TATATAT CCGGCCGC GGC CTG 

264 RGS69 TATATAT CCGGCCGC CAG GGC 

265. RGS70 TATATAT CCCCCCC ^ ™ « 

266. RGS71 TATATAT 



CAG CTG 

267. RGS72 TATATAT TGC CAG 

268 . RGS73 TATATAT GCOGCCGC GCC TGC 



269. RGS74 TATATAT 



GCA CCC 



270. RGS75 TATATAT CCGGCCGC CAA GCA 

271. ROS76 TATATAT CC GG CCGC GCT CAA 

272. RGS77 TATATAT CCGGCCGC GTT GCT 

273. RGS78 TATATAT CCGGCCGC GAG GTT 

274. RGS79 TATATAT GCGCCCGC ATG CAG 

275 . RCS80 TATATAT GCGGCCCC GCT ATG 

276. RGS81 TATATAT CCGGCCGC GCC GCT 

277. RGS82 TATATAT GC GCC C G C AAG GCC 

278. RGS83 TATATAT GCGGCCCC GAA AAG 

279. RGS84 TATATAT C CGGCCGC GAG GAA 

280. RGS85 TATATAT GCGGCCCC GTA CAG 

281. RGS86 TATATAT GCGGCCCC CTG GTA 

282. RGS67 TATATAT GCGCCCGC CCC CTG 

283 . RGS88 TATATAT CCGGCCGC CAG CCC 

284. ROS89 TATATAT GCGGCCCC CAG GAG 

285. RGS90 TATATAT CCGOCCGC CTG CAG 

286. RCS91 TATATAT GCGCCCGC GCC CTG 

287 . RCS92 TATATAT CCGGCCGC CAG GGC 

288 . RGS93 TATATAT GCGGCCCC TTC CAG 

289. RGS94 TATATAT CCGGCCGC CCC TTC 

290. ROS95 TATATAT CCGGCCGC TAT CCC 

291. RGS96 TATATAT CCGGCCGC GCA TAT 

292 . RGS97 TATATAT C CGGC C G C GGG GGA 

293 . ROS98 TATATAT G C UG CC GC CTC GGG 

294. RGS99 TATATAT CCGGCCGC CAA CTC 

295. ROS100 TATATAT GCGCCCGC ACC CAA 

296. RGS101 TATATAT CCG GC C G C GGG ACC 

297. RGS102 TATATAT GCOGCCGC GOT GGG 

298. RG5103 TATATAT C^ UGC C GC CAA OCT 

299 . RGS104 TATATAT GCGCCCGC CTC CAA 

300. RGS105 TATATAT GCGCCCGC TCT CTC 

301. RGS106 TATATAT OCGGCCCC CAG TOT 

302. RCS107 TATATAT GCOGCCGC CTC CAG 

303. RCS10B TATATAT GCGCCCGC CAG CTG 

304. RGS109 TATATAT GCGGCCCC CTC CAG 

305. RCS110 TATATAT CC GGC C G C GAC GTC 

306 . RGS111 TATATAT CCGGCCGC CGC CAC 

307 . RGS112 TATATAT GCOGCCGC CTC GGC 

308. RGS113 TATATAT GCGGCCCC AAA GTC 

309. RCS114 TATATAT GCGOCCGC GGC AAA 

310. RGS115 TATATAT GC GOC CGC GOT GGC 

311 . RGS116 TATATAT GCGOCCGC GOT OCT 

312. RGS117 TATATAT CCCCCCGC GAT COT 

313 . RCS118 TATATAT GCGGCCCC CCA CAT 

314. RGS119 TATATAT GCOGCCGC CTG CCA 

315. RGS120 TATATAT GCGCCCGC CTG CTC 

316 . RCS121 TATATAT GCGOCCGC CAT CTG 

317. RGS122 TATATAT GCOGCCGC TTC CAT 

318. RG5123 TATATAT OCGGCCCC TTC TTC 

319. RCS124 TATATAT GCGCCCGC CAG TTC 

320 . RC5125 TATATAT CCGGCCGC TCC CAG 

321. RGS126 TATATAT GC GCC C G C CAT TCC 

322 . RGS127 TATATAT GCGCCCGC GGC CAT 

323. RGS128 TATATAT GCOGCCGC AGO GGC 

324. RGS129 TATATAT GCGCCCGC GGC AGG 
325.. RGS130 TATATAT GCGCCCGC CAG GGC 

326 . RGS131 TATATAT G C GCC CGC CTG CAG 

327. RCS132 TATATAT GCOGCCGC GGG CTG 
22? . RGS133 TATATAT CCCCCCGC GOT GGG 

329 . RGS134 TATATAT GCGCCCGC CTC OCT 

330. RGS135 TATATAT GCGGCCCC ACC CTC 

331. RG5136 TATATAT CCCGCCCC GGC ACC 

332. RGS137 TATATAT G CGG CCCC CAT GGC 

333. RGS12* TATATAT GCGGCCCC CGC CAT 
. 334. ROS13S TATATAT GCG O CC GC CGC CCG 

335. P.CSi40 TATATAT GCGGCCCC GAA CGC 

l it . RCS141 TATATAT GCGGCCCC GGC GAA 

337. RC5142 TATATAT -CCCCCCGC ASA GCC 
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TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT . 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 

TATATAT 



GCGGCCCC 
CCGGCCCC 

ce ccc e cc 

GCGGCCCC 
GCGGCCCC 
GCGGCCCC 
GCGGCCCC 
CCGGCCCC 



GCGGCCCC 
G CGG CCCC 
GCGGCCCC 
GCGGCCCC 
GCGGCCCC 



GCGGCCCC 
GCGGCCCC 
GCGGCCCC 
GCGGCCCC 



GCGGCCCC 



GCGGCCCC 
GCGGCCCC 
GCGGCCGC 
GCGGCCCC 
GCGGCCGC 
GCGGCCCC 
GCGGCCGC 
GCGGCCCC 



AGC 

GAA 

CTG 

CCG 

CCG 

TGC 

TCC 

CCC 

GAC 

CAG 

AAC 

AGC 

GCT 

ATG 

CAG 

CTG 

GCT 

GAA 

CAG 

CTC 

CAC 

CCA 

GTA 

GCC 

AAC 

TAG 

GCC 

CTG 

AAG 

CCC 

CTG 



AGA 
AGC 
GAA 
CTG 
CCC 
CCG 
TGC 
TCC 
CCC 
GAC 
CAG 
AAC 
AGC 
GCT 
ATG 
CAG 
■CTC 
CCT 
GAA 
CAC 
CTC 
GAC 
CCA 
GTA 
GCG 
AAC 
TAG 
GCG 
GTG 
AAG 



GGC 

AGC 

GAA 

CTG 

GCG 

CCG 

TCC 

TCC 

CCC 

CAC 

CAC 

AAC 

AGC 

GCT 

ATG 

CAG 

CTG 

CCT 

GAA 

CAG 

CTC 

CAC 

CGA 

GTA 

GCG 

AAC 

TAG 

CCG 

GTG 

AAG 



GAA 
GGC 
AGA 
AGC 
GAA 
-CTG 
GCG 
CCG 
TGC 
TCC 
'CCC 
CAC 
CAG 
AAC 
AGC 
GCT 
ATG 
CAG 
CTC 
GCT 
GAA 
CAG 
CTC 
CAC 
CGA 
GTA 
CCG 
AAC 
TAG 
CCG 
GTG 



GAA 

GGC 

AGA 

AGC 

GAA 

CTG 

CCG 

CCG 

TCC 

TCC 

CCC 

GAC 

CAG 

AAC 

AGC 

GCT 

ATG 

CAG. 

CTG 

GCT 

GAA 

CAG 

CTC 

CAC 

CGA 

GTA 

GCG 

AAC 

TAG 

GCG 



CGC 
CCC 
GAA 
GGC 
AGA 



CTC 
CCC 
CCG 
TGC 
TCC 
CCC 
GAC 
CAG 
AAC 
AGC 
CCT 
ATG 
CAG 
CTC 



CAG 
CTC 



CTA 
GCG 
AAC 

TAG 



CAT 
CCG 
GGC 
GAA 
GGC 



GAA 
CTC 
GCG 
CCC 
TCC 
TCC 
CCC 
GAC 
CAC 
AAC 
AGC 
CCT 
ATG 
CAG 
CTC 
CCT 
GAA 
CAC 
CTC 
CAC 
CGA 
GTA 
CCG 
AAC 
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CLAIMS 



What is claimed is: 



1. A method for making a biologically-active circularly-permuted protein of 
the formula CM-^-N 1 , derived from a parent protein of the formula N^C 1 , 
5 wherein 

C 1 is comprised of a segment derived from the carboxy portion of 
said parent protein; 

N 1 is comprised of a segment derived from the amino terminal 
portion of said parent protein; and 

10 L 1 is a chemical bond or a linker, linking C 1 to the amino terminus 

of L 1 and carboxy terminus of L 1 to the amino terminus of N 1 ; 

comprising the steps of: 

(a) making a series of circularly-permuted genes; 

(b) inserting said circularly-permuted genes into a display vector; 

15 (c) expressing said circularly-permuted genes such that the proteins 

encoded by said genes are presented on the surface of the display 
vector; 

(d) generating a library of display vectors presenting the expressed 
circularly permuted protein; 

20 (e) affinity-selecting the presenting display vectors with a target 

protein that can bind a biologically-active circularly-permuted 
protein; 

(f) isolating and analyzing clones of selected display vectors to identify 
the presented circularly-permuted protein. 

25 2. The method of claim 1 wherein the method of making a series of 

circularly-permuted genes is selected from the group consisting of 
making a tandemly-repeated intermediate, total synthesis of a 
synthetic gene, assembly of a gene from synthetic oligonucleotides, 
DNA amplification, and limited digestion of a circular 
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intermediate. 

The method of claim 1 wherein said display vector is selected from 
the group consisting of bacteriophage display vectors, bacteria, and 
baculovirus vectors. 

4. The method of claim 3, wherein said presentation vector is 
a bacteriophage* 

5. The method of claim 4, wherein said presentation 
vector is bacteriophage M13. 

6. The method of claim 5, wherein said 
presentation vector is a bacteriophage M13 
gene III vector. 

The method of claim 1 wherein said method of making a series of 
circularly permuted genes is a method of making a tandem repeat 
intermediate. 

8 The method of claim 7 wherein said circularly-permuted 

genes are amplified from the repeat by gene amplification. 

The method of claim 1 wherein said method of affinity selection 
comprises the steps consisting of: 

(a) binding said presentation display vectors to a target 
protein; 

(b) eluting said display vectors; 

(c) amplifying said display vectors; and 

(d) biopanning a pool of said amplified display vectors. 

The method of claim 1 wherein L 1 is a linear peptide linker. 

The method of claim 1 wherein said the DNA sequence encoding 
said linker L 1 is selected from the group consisting of SEQ ID NO: 
1 through SEQ ID NO: 368. 

The method of claim 1 wherein the length of the C 1 in said 
permutein is shorter than the length of C l in said parent protein. 
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13. The method of claim 1 wherein the length of the N 1 in said 
permutein is shorter than the length of N 1 in said parent protein. 

14. A circularly-permuted protein prepared by the method of claim 1. 

15. A circularly-permuted protein of claim 14 comprising the G- 
CSF receptor agonist domain of a species of mylepoietin 
(MPO). 
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Figure 1A 
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Figure IB 

Screening a display library 
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Figure 2 



Permuted proteins in scanning permutagenesis library 
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Figure 3 

BHK MPO samples in the BAF/G-CSF 
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SEQUST.TXT 



SEQUENCE LISTING 

<110> Lee, Stephen C 

<120> Method of Producing Permuteins toy 
Scanning Permut agenesis 

<130> C_3041 

<140> 60/ 

<141> 1998-09-25 

<160> 368 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<21i> 36 
<212> DNA 
<213> Synthetic 

<400> 1 

cgcgcgcaca tgtctacacc attgggccct gccagc 

<210> 2 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 2 

cgcgcgcaca tgtctccatt gggccctgcc agctcc 

<210> 3 
<2X1> 36 
<212> ENA 
<213> Synthetic 

<400> 3 

cgcgcgcaca tgtctttggg ccctgccagc tccctg 

<210> 4 

<211> 36 

<212> UNA 

<213> Synthetic 

<400> 4 

cgcgcgcaca tgtctggccc tgccagctcc ctgccc 

<2I0> 5 
<21X> 3€ 
<212> SNA 
<213> Synthetic 

<400> 5 

cgcgcgcaca tgtctcctgc cagctccctg ccccag 

<210> 6 

<211> 36 

<212> UNA 

<2 13 > Synthetic 



<400> 6 

cgcgcgcaca tgtctgccag ctocctgccc -cagagc 



36 
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<210> 7 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<400> 7 

cgcgcgcaca tgtctagctc cctgccccag agcttc 

<210> 8 
<211> 36 
<212> ENA 
<213> Synthetic 



<400> 8 

cgcgcgcaca tgtcttccct gccccagagc ttcctg 

<210> 9 
<211> 36 

<2i2> dna 
<213> Synthetic 

<400> 9 

cgcgcgcaca tgtctctgcc ccagagcttc ctgctc 

<210> 10 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 10 

cgcgcgcaca tgtctcccca gagcttcctg ctcaag 



<210> 11 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 11 

cgcgcgcaca tgtctcagag cttcctgctc aagtct 36 

<210> 12 
<211> 36 
<212> DMA 
<213> Synthetic 

<400> 12 

cgcgcgcaca tgtctagctt cctgctcaag tcttta 36 

<210> 13 
<211> 36 
<212> DMA 
<213> Synthetic 



<400> 13 

cgcgcgcaca tgtctttcct gctcaagtct ttagag 36 

<210> 14 
<211> 36 
<212> DNA 
<212> Synthetic 

<400> 14 

cgcgcgcaca tgtctctgct caagtcttta gagcaa 36 

<213> 15 
<211> 36 
<212> DMA 
<213> Synthetic 
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<400> 15 

cgcgcgcaca tgtctctcaa gtctttagag caagtg 36 

<210> 16 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 16 

cgcgcgcaca tgtctaagtc tttagagcaa gtgagg 36 

<210> 17 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 17 

cgcgcgcaca tgtcttcttt agagcaagtg aggaag 36 

<210> 18 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 18 

cgcgcgcaca tgtctttaga gcaagtgagg aagatc 36 

<210> 19 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 19 

cgcgcgcaca tgtctgagca agtgaggaag atccag 36 

<210> 20 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 20 

cgcgcgcaca tgtctcaagt gaggaagatc cagggc 36 

<210> 21 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 21 

cgcgcgcaca tgtctgtgag gaagatccag ggcgat 36 

<210> 22 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 22 

cgcgcgcaca tgtctaggaa gatccagggc gatggc 36 

<210> 23 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 23 

cgcgcgcaca tgtctaagat ccagggcgat ggcgca 36 



<210> 24 
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<211> 36 
<212> DNA 
<213> Synthetic 

<400> 24 

cgcgcgcaca tgtctatcca gggcgatggc gcagcg 

<210> 25 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 25 

cgcgcgcaca tgtctcaggg cgatggcgca gcgctc 



<210> 26 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 26 

cgcgcgcaca tgtctggcga tggcgcagcg ctccag 

<210> 27 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 27 

cgcgcgcaca tgtctgatgg cgcagcgctc caggag 

<210> 28 
<211> 36 
<212> DNA 
<213> Synthetic 



36 



36 



<400> 28 

cgcgcgcaca tgtctggcgc agcgctccag gagaag 36 

<210> 29 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 29 

cgcgcgcaca tgtctgcagc gctccaggag aagctg 36 

<210> 30 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 30 

cgcgcgcaca tgtctgcgct ccaggagaag ctgtgt 36 

<210> 31 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 31 

cgcgcgcaca tgtctctcca ggagaagctg tgtgcc 36 

<210> 32 
<211> 36 
<212> DNA 
<2i3> Synthetic 

<400> 32 
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cgcgcgcaca tgtctcagga gaagctgtgt gccacc 



36 



<210> 33 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 33 

cgcgcgcaca tgtctgagaa gctgtgtgcc acctac 36 

<210> 34 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 34 

cgcgcgcaca tgtctaagct gtgtgccacc tacaag 36 

<210> 35 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 35 

cgcgcgcaca tgtctctgtg tgccacctac aagctg 36 

<210> 36 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 36 

cgcgcgcaca tgtcttgtgc cacc tacaag ctgtgc 36 

<210> 37 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 37 

cgcgcgcaca tgtctgccac ctacaagctg tgccac 36 

<210> 38 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 38 

cgcgcgcaca tgtctaccta caagctgtgc cacccc 36 

<210> 39 
<2X1> 36 
<212> DNA 
<2i3> Synthetic 

<400> 39 

cgcgcgcaca tgtcttacaa gctgtgccac cccgag 36 

<210> 40 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 40 

cgcgcgcaca tgtctaagct gtgccacccc gaggag 36 

<210> 41 
<211> 36 
<212> DNA 
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<213> Synthetic 
<400> 41 

cgcgcgcaca tgtctctgtg ccaccccgag gagctg 36 

<210> 42 

<211> 36 

<212> DNA 

ic2X3> Synthetic 

<4O0> 42 

cgcgcgcaca tgtcttgcca ccccgaggag ctggtg 36 

<210> 43 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 43 

cgcgcgcaca tgtctcaccc cgaggagctg gtgctg 36 

<210> 44 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 44 

cgcgcgcaca tgtctcccga ggagctggtg ctgctc 36 

<210> 45 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 45 

cgcgcgcaca tgtctgagga gctggtgctg ctcgga 36 

<210> 46 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 46 

cgcgcgcaca tgtctgagct ggtgctgctc ggacac 36 

<210> 47 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 47 

cgcgcgcaca tgtctctggt gctgctcgga cactct 36 

<210> 48 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 48 

cgcgcgcaca tgtctgtgct gctcggacac tctctg 36 

<210> 49 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 49 

cgcgcgcaca tgtctctgct cggacactct ctgggc 36 
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<210> 50 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 50 

cgcgcgcaca tgtctctcgg acactctctg ggcatc 

<210> 51 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 51 . 

cgcgcgcaca tgtctggaca ctctctgggc atcccc 

<210> 52 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 52 

cgcgcgcaca tgtctcactc tctgggcatc ccctgg 

<210> 53 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 53 

cgcgcgcaca tgtcttctct gggcatcccc tgggct 



<210> 54 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 54 

cgcgcgcaca tgtctctggg catcccctgg gctccc 

<210> .55 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 55 

cgcgcgcaca tgtctggcat cccctgggct cccctg 

<210> 56 

<211> 36 

<212> CNA 

<213> Synthetic 



36 



36 



<400> 56 

cgcgcgcaca tgtctatccc ctgggctccc ctgagc 

<210> 57 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 57 

cgcgcgcaca tgtctccctg ggctcccctg agctcc 

<210> 58 

<211> 36 

<212> DNA 

<213> Synthetic 
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<400> SB 

cgcgcgcaca tgtcttgggc tcccctgagc tcctgc 36 

<210> 59 
<211> 36 
<212> DNA 
<213> Synthetic 

«c400> 59 

cgcgcgcaca tgtctgctcc cctgagctcc tgcccc 36 

<210> 60 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 60 

cgcgcgcaca tgtctcccct gagctcctgc cccagc 36 

<210> 61 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 61 

cgcgcgcaca tgtctctgag cccctgcccc agccag 36 

<210> 62 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 62 

cgcgcgcaca tgtctagctc ctgccccagc caggcc 36 

<210> 63 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 63 

cgcgcgcaca tgtcttcctg ccccagccag gccctg 36 

<210> 64 

<211> 36 

<212> DNA * 

<213> Synthetic 

<400> 64 

cgcgcgcaca tgtcttgccc cagccaggcc ctgcag 36 

<210> 65 
<211> 36 
<212> DNA 
<213> Synthetic 

<40O> 65 

cgcgcgcaca tgtctcccag ccaggccctg cagctg 36 

<210> 66 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 66 

cgcgcgcaca tgtctagcca ggccctgcag ctggca 36 

<210> 67 
<211> 36 
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<2X2> DNA 
<213> Synthetic 

<400> 67 

cgcgcgcaca tgtctcaggc cctgcagctg gcaggc 

<210> 68 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 68 

cgcgcgcaca tgtctgccct gcagctggca ggctgc 

<210> 69 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 69 

cgcgcgcaca tgtctctgca gctggcaggc tgcttg 

<210> 70 
<211> 36 
<212> DNA 
<213> Synthetic 



«c400> 70 

cgcgcgcaca tgtctcagct ggcaggctgc ttgagc 

<210> 71 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<400> 71 

cgcgcgcaca tgtctctggc aggctgcttg agccaa 

<210> 72 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 72 

cgcgcgcaca tgtctgcagg ctgcttgagc caactc 

<210> 73 
<211> 36 
<212> DNA 
<213> Synthetic 

c4 G»> 73 

cgcgacgeaca tgtctggctg ctcgagcc&a ctccat 

<211> 36 
<212> DNA 
<213> Synthetic 

<400> 74 

cgcgcgcaca tgtcttgctt gagccaactc catagc 

<210> 73 
<211> 36 
<212> DNA 
<213> Synthetic: 



<400> 75 

cgcgcgcaca tgtctttgag ccaactccat agcggc 



36 
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<210> 76 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 76 

cgcgcgcaca tgtctagcca actccatagc ggcctt 

<210> 77 
<211> 36 
<212> dna 
<213> Synthetic 

<400> 77 

cgcgcgcaca tgtctcaact ccatagcggc cttttc 36 

<210> 78 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 78 

cgcgcgcaca tgtctctcca tagcggcctt ttcctc 36 

<210> 79 
<211> 36 
<2X2> DNA 
<213> Synthetic 

<400> 79 

cgcgcgcaca tgtctcatag cggccttttc ctctac 36 

<210> 80 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 80 

cgcgcgcaca tgtctagcgg ccttttcctc taccag 

<210> 81 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 81 

cgcgcgcaca tgtctggcct tttcctctac cagggg 36 

<210> 82 
<211> 36 
<212> DNA 
< c;5> Kyatiie tie 

<4C*G> 82 

cpcgcflCEca tgtctctttt cctc taccag gggctc 36 

<i!0> 83 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 83 

cgcgcgcaca tgtcctccct ctaccagggg ctcctg 36 

<210> 84 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 84 

cgcgcgcaca tgtctctcta ccaggggctc ctgcag 

<210> 85 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 85 

cgcgcgcaca tgtcttacca ggggctcctg caggcc 

<210> 86 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 86 

cgcgcgcaca tgtctcaggg gctcctgcag gccctg 

. <210> 87 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 87 

cgcgcgcaca tgtctgggct cctgcaggcc ctggaa 

<210> 88 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 88 

cgcgcgcaca tgtctctcct gcaggccctg gaaggg 

<210> 89 
<211> 36 
<212> DNA 
<213> Synthetic 



36 



36 



<400> 89 

cgcgcgcaca tgtctctgca ggccctggaa gggata 

<210> 90 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 90 

cgcgcgcaca tgtctcaggc cctggaaggg atatcc 

<21^> 9- 
<211> 36 
<2U> DNA 
<.2 Vi> Synthetic 

<400> 91 

cgcgcgcaca tgtctgccct ggaagggata tccccc 

<210> 92 
<211> 36 
<212> DNA 
<212> Synthetic 

<400> 92 

cgcgcgcaca tgtctctgga agggatatcc cccgag 
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<211> 36 
<212> DNA 
<213> Synthetic 

<400> 93 

cgcgcgcaca tgtctgaagg gatatccccc gagttg 

<210> 94 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 94 

cgcgcgcaca tgtctgggat atcccccgag ttgggt 

<210> 95 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 95 

cgcgcgcaca tgtctatatc ccccgagttg ggtccc 

<210> 96 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 96 

cgcgcgcaca tgtcttcccc cgagttgggt cccacc 

<210> 97 

<211> 36 

<212> DNA 

<213> Synthetic 



<400> 97 

cgcgcgcaca tgtctcccga gttgggtccc accttg 

<210> 98 

<211> 36 

<212> DNA 

<213> Synthetic 



36 



<400> 98 

cgcgcgcaca tgtctgagtt gggtcccacc ttggac 

<210> 99 

<211> 36 

<212> DNA 

<213> Synthetic 

<4tl0> 99 

cgcgcgcaca tgtctttggg tcccaccttg gacaca 

<210> 100 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> IOC 

cgcgcgcaca *gtecggtcc caccctggac acactg 

<210> 101 

<211> 36 

<212> DNA * 

<212> Synthetic 

<400> ic: 
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cgcgcgcaca tgtctcccac cttggacaca ctgcag 36 

<210> 102 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 102 

cgcgcgcaca tgtctacctt ggacacactg cagctg 36 

<210> 103 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 103 

cgcgcgcaca tgtctttgga cacactgcag ctggac 35 

<210> 104 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 104 

cgcgcgcaca tgtctgacac actgcagctg gacgtc 3 6 

<210> 105 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 105 

cgcgcgcaca tgtctacact gcagctggac gtcgcc 36 

<210> 106 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 106 

cgcgcgcaca tgtctctgca gctggacgtc gccgac 36 

<210> 107 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 107 , 
cgcgcgcaca tgtctcagct ggacgtcgcc gacttt 

<210> 108 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 108 

cgcgcgcaca tgtctctgga cgtcgccgac tttgcc 36 

<210> 109 
<21X> 36 
<2I2> DNA 
<213> Synthetic 

<400> 109 

cgcgcgcaca tgtctgacgt cgccgacttt gccacc 36 

<2ic> 

<211> 36 
<212> DNA 
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<213> Synthetic 
<400> 110 

cgcgcgcaca tgtctgtcgc cgactttgcc accacc 3 6 

<210> 111 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 111 

cgcgcgcaca tgtctgccga ctttgccacc accatc 36 

<210> 112 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 112 

cgcgcgcaca tgtctgactt tgccaccacc atctgg 36 

<210> 113 

<211> 36 

<212> DNA \ 

<213> Synthetic 

<400> 113 

cgcgcgcaca tgtcttttgc caccaccatc tggcag 36 

<210> 114 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 114 

cgcgcgcaca tgtctgccac caeca tctgg cagcag 36 

<210> 115 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 115 

cgcgcgcaca tgtctaccac catctggcag cagatg 36 

<210> 116 
<211> 36 
<212> DNA 
<213> Synthetic 

<400:> 116 

cgcgcgcaca tgtctaccat ctggcagcag atggaa 3€ 

<210> 117 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 117 

cgcgcgcaca tgtctatctg gcagcagatg gaagaa 3€ 

<2I0> 118 
<211> 36 
<212> DNA 
<213> Synthetic 

«400> IIS 

cgcgcgcaca tgtcttggca gcagatggaa gaactg 3 6 
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<210> 119 

<211> 36 

<212> ONA 

<213> Synthetic 

<400> 119 

cgcgcgcaca tgtctcagca gatggaagaa ctggga 

<210> 120 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 120 

cgcgcgcaca tgtctcagat ggaagaactg ggaatg 

<210> 121 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 121 

cgcgcgcaca tgtctatgga agaactggga atggcc 

<210> 122 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 122 ■ 
cgcgcgcaca tgtctgaaga actgggaatg gcccct 

<210> 123 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 123 

cgcgcgcaca tgtctgaact gggaatggcc cctgcc 

<210> 124 

<211> 3 6 

<212> DNA 

<213> Synthetic 

<400> 124 

cgcgcgcaca tgtctctggg aatggcccct gccctg 

<210> 125 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 125 

cgcgcgcaca tgtctggaat ggcccctgcc ctgcag 

<210> 126 

<211> 36 

<212> DNA 

<213> Synthetic ' 

<400> 126 - 
cgcgcgcaca tgtctatggc ccctgccctg cagccc 

<2i0> 127 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 12*7 

cgcgcgcaca tgtctgcccc tgccctgcag cccacc 36 

<210> 128 
<211> 36 

<212> dna 

<213> Synthetic 
<400> 128 

cgcgcgcaca tgtictcctgc cctgcagccc acccag 36 

<210> 129 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 129 

cgcgcgcaca tgtctgccct gcagcccacc cagggt 36 

<210> 130 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 130 is 
cgcgcgcaca tgtctctgca gcccacccag ggtgcc 36 

<210> 131 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 131 

cgcgcgcaca tgtctcagcc cacccagggt gccatg 

<210> 132 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 132 

cgcgcgcaca tgtctcccac ccagggtgcc atgccg 36 

<210> 133 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 133 

cgcgcgcaca tgtctaccca gggtgccatg ccggcc 36 

<210> 134 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 134 ■ 
cgcgcgcaca tgtctcaggg tgccatgccg gccttc 36 

<210> 135 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 135 . ' 
cgcgcgcaca tgtctggtgc catgccggcc ttcgcc 36 

<210> 136 
<211> 36 
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<212> DNA 
<2i3> Synthetic 

<400> 136 

cgcgcgcaca tgtctgccac gccggccttc gcctct 3 * 

<210> 137 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 137 

cgcgcgcaca tgtctatgcc ggccttcgcc tctgct 36 

<210> 138 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 138 

cgcgcgcaca tgtctccggc cttcgcctct gctttc 3 * 

<210> 139 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 139 

cgcgcgcaca tgtctgcctt cgcctctgct ttccag 3€ 

<210> 140 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 140 

cgcgcgcaca tgtctttcgc ctctgctttc cagcgc 36 

<210> 141 

<211> 36 

<212> DNA 

<2X3> Synthetic 

<400> 141 

cgcgcgcaca tgtctgcctc tgctttccag cgccgg 36 

<210> 142 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 142 

cgcgcgcaca tgtcttctgc tttccagcgc cgggca 36 

<210> 143 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 143 

cgcgcgcaca tgtctgcttt ccagcgccgg gcagga 36 

<210> 144 
<211> 36 
<212> DNA 
<213> Synthetic 



<400> 144 

cgcgcgcaca tgtctttcca gcgccgggca ggaggg 
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<210> 145 

<2X1> 36 

<212> DNA 

<213> Synthetic 

<400> 145 

cgcgcgcaca tgtctcagcg ccgggcagga ggggtc 36 

<210> 146 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 146 

cgcgcgcaca tgtctcgccg ggcaggaggg gtcctg 36 

<210> 147 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 147 

cgcgcgcaca tgtctcgggc aggaggggtc ctggtt 36 

<210> 148 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 148 

cgcgcgcaca tgtctgcagg aggggtcctg gttgct 36 

<210> 149 
<211> 36 
<212> UNA 
<213> Synthetic 

<400> 149 

cgcgcgcaca tgtctggagg ggtcctggtt gctagc 36 

<210> 150 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 150 

cgcgcgcaca tgtctggggt cctggttgct agccat 3^ 

<210> 151 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 151 

cgcgcgcaca tgtctgtcct ggttgctagc catctg 36 

<210> 152 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 152 

cgcgcgcaca tgtctctggt tgctagccat ctgcag 36 

<210> 153 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 153 

cgcgcgcaca tgtctgttgc tagecatctg cagagc 36 

<210> 154 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 154 

cgcgcgcaca tgtctgctag ccatctgcag agcttc 36 

<210> 155 
<211> 36 f 
<212> DNA 
<213> Synthetic 

<400> 155 

cgcgcgcaca tgtctagcca tctgcagagc ttcctg 36 

<210> 156 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 156 

cgcgcgcaca tgtctcatct gcagagcttc ctggag 36 

<210> 157 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 157 

cgcgcgcaca tgtctctgca gagcttcctg gaggtg 36 

<210> 158 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 15B 

cgcgcgcaca tgtctcagag cttcctggag gtgtcg 36 

<210> 159 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 159 

cgcgcgcaca tgtctagctt cctggaggtg tcgtac 36 

<210> 160 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 160 

cgcgcgcaca tgtctttcct ggaggtgtcg taccgc 36 

<210> 161 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 161 

cgcgcgcaca tgtctctgga ggtgtcgtac cgcgtt 3 6 



<210> 162 
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<211> 36 
<212> DNA 
<213> Synthetic 

<400> 162 

cgcgcgcaca tgtctgaggt gtcgtaccgc gttcta 36 

<210> 163 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 163 

cgcgcgcaca tgtctgtgtc gtaccgcgtt ctacgc 36 

<210> 164 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 164 

cgcgcgcaca tgtcttcgta ccgcgttcta cgccac 36 

<210> 165 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 165 

cgcgcgcaca tgtcttaccg cgttctacgc cacctt 3 * 

<210> 166 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 166 

cgcgcgcaca tgtctcgcgt tctacgccac cttgcg 36 

<210> 167 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 167 

cgcgcgcaca tgtctgttct acgccacctt gcgcag 36 

<210> 168 
<211> 36 
<212> DKk 
<213> Synthetic 

<400> 168 

cgcgcgcaca tgtctctacg ccaccttgcg cagccc 3 * 

<210> 169 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 169 

cgcgcgcaca tgtctcgcca ccttgcgcag cccgac 3 6 

<210> 170 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 170 
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cgcgcgcaca tgtctcacct tgcgcagccc gacatg 36 

<210> 171 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 171 

cgcgcgcaca tgtctcttgc gcagcccgac atggct 3*6 

<210> 172 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 172 

cgcgcgcaca tgtctgcgca gcccgacatg gctaca 36 

<210> 173 

<211"> 36 

<212> DNA 

<213> Synthetic 

<400> 173 

cgcgcgcaca tgtctcagcc cgacatggct acacca 36 

<210> 174 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 174 

cgcgcgcaca tgtctcccga catggctaca ccattg 36 

<210> 175 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 175 

cgcgcgcaca tgtctcgcca ccttgcgcag cccact 36 

<210> 176 

<211> 36 

<212> DNA 

<213> Synthetic 

<40O> 176 

cgcgcgcaca tgtctcacct tgcgcagccc actagt 36 

<210> 177 

<211> 36 

<2i2> CiJA 

<213> Synthetic 

<4O0> 177 

cgcgcgcaca tgtctcttgc gcagcccact agtcat 36 

<210> 178 
<211> 36 
*212> DN r A 
<213> Synthecic 

<400> 178 

cgcgcgcaca tgtctgcgca gcccactagt catcca 

<21G> 1*73 
<211> 36 
<212> DNA 
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<213> Synthetic 
<400> 179 

cgcgcgcaca tgtctcagcc cactagtcat -ccacct 3 « 

<210> 180 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 180 

cgcgcgcaca tgtctcccac tagtcatcca cctatg 36 

<210> 181 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 181 

cgcgcgcaca tgtctcgcca ccttgcgcag cccggc 36 

<210> 182 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 182 ■ 
cgcgcgcaca tgtctcacct tgcgcagccc ggcggc 3* 

<210> 183 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 183 

cgcgcgcaca tgtctcttgc gcagcccggc ggcggc 36 

<210> 184 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 184 

cgcgcgcaca tgtctgcgca gcccggcggc ggctct 

<210> 185 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 185 

cgcgcgcaca tgtctcagcc cggcggcggc tctgac 3 '6 

<210> 186 
<211> 36 

<~13> Synthetic 

<S-3\» 1*6 

cgcgcgcaca tgtctcccgg cggcggctct gacatg 36 

<210> 187 
<211> 36 
<212> DNA 
<22S> Esthetic 

<400> 187 

tatatarccg gccgcagcca ^gtcgggctg cgcaec 36 
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<210> 188 
<2U> 36 
<212> DNA 
<213> Synthetic 

<400> 188 

tatatatgcg gccgcagcca tgtcacgcgt acgatt 

<210> 189 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 189 

tatatatgcg gccgcagcca tgtcagagcc gccgcc 

<210> 190 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 190 36 
tatatatgcg gccgctgtag ccatgtcggg ctgcgc 

<210> 191 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 191 _ € 
tatatatgcg gccgctgtag ccatgtcacg cgtacg 

<210> 192 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 192 

tatatatgcg gccgctgtag ccatgtcaga gccgcc 

<210> 193 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 193 

tatatatgcg gccgctggtg tagccatgtc gggctg 

<210> 194 
<211> 36 
<212> ENA 
<?.13> Synthetic 

<400> 134 

tatatatgcg gccgctggtg tagccatgtc acccgt 

<210> 195 
<2X1> 36 
<212> DNA 
<213> Synthetic 

<400> 195 

tatatatgcg gccgctggtg tagccatgtc agagcc 

<2L0> 196 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 196 

tatatatgcg gccgccaatg gtgtagccat gtcggg 

<210> 197 
<211> 36 
<212> DNA 
<213> Synthetic 

<40O> 197 

tatatatgcg gccgccaatg gtgtagccat gtcacg 

<210> 198 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 198 

tatatatgcg gccgccaatg gtgtagccat gtcaga 

<210> 199 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 199 

tatatatgcg gccgcgccca atggtgtagc catgtc 

<210> 200 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 200 

tatatatgcg gccgcagggc ccaatggtgt agccat 

<210> 201 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 201 

tatatatgcg gccgcggcag ggcccaacgg tgtagc 

<210> 202 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 202 

tatatatgcg gccgcgctgg cagggcccaa tggtgt 

<210> 203 
<211> 36 
<212> DNA 
<2IS* Synthetic 

«A26> 703 

tatatatgcg gccgcgcagc tggcagggcc caatgg 

<H0> 204 
<21i> 5£ 
<212> DMA 
<213> Synthetic 

<4C0> 204 

tatatatgcg gccgccxggc igctgacagg gcccaa 

<210> 205 
<211> 36 
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<212> DNA 
<213> Synthetic 

<400> 205 

tatatatgcg gccgcgggca gggagctggc agggcc 3€ 

<210> 206 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 206 

tatatatgcg gccgcctggg gcagggagct ggcagg 36 

<210> 207 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 207 

tatatatgcg gccgcgctct ggggcaggga gctggc 36 

<210> 208 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 208 

tatatatgcg gccgcgaagc tctggggcag ggagct 3* 

<210> 209 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 209 

tatatatgcg gccgccagga agctctgggg caggga 3€ 

<210> 210 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 210 

tatatatgcg gccgcgagca ggaagctctg gggcag 3€ 

<210> 211 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 211 

tatatatgcg gccgccttga gcaggaagct ctgggg 36 

<210> 212 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 212 

-.s^atatgcg gccgcagact tgagcaggaa gctctg 36 

<210> 213 

<211> 36 

<212> DNA 

<213> Syr.zr.ozic 



<400> 213 

tatatatgcg gccgctaaag acttgagcag gaagct 
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<210> 214 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 214 

tatatatgcg gccgcctcta aagacttgag caggaa 

<210> 215 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 215 

tatatatgcg gccgcttgct ctaaagactt gagcag 

<210> 216 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 216 

tatatatgcg gccgccactt gctctaaaga cttgag 

<210> 217 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 217 

tatatatgcg gccgccctca cttgctctaa agactt 

<210> 218 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 218 

tatatatgcg gccgccttcc tcacttgctc taaaga 

<210> 219 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 219 

tatatatgcg gccgcgatct tcctcacttg ctctaa 

<210> 220 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 220 

tatatatgcg gccgcctgga tcttcctcac ttgctc 

<2i0> 221 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 221 

tatatatgcg gccgcgccct ggatcttcct cacttg 

<2i:> 222 
<211> 36 
<212> DNA 
<213> Synthetic 
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<400> 222 

tatatatgcg gccgcatcgc cctggatctt -cctcac 36 

<210> 223 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 223 

tatatatgcg gccgcgccat cgccctggat cttcct 36 

<210> 224 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 224 

tatatatgcg gccgctgcgc catcgccctg gatctt 36 

<210> 225 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 225 

tatatatgcg gccgccgctg cgccatcgcc ctggat 36 

<210> 226 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 226 

tatatatgcg gccgcgagcg ctgcgccatc gccctg 36 

<210> 227 

<211> 36 

<212> DNA 

<213> Synthetic 

<400> 227 

tatatatgcg gccgcctgga gcgctgcgcc atcgcc 36 

<210> 228 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 228 

tatatatgcg gccgcctcct ggagcgctgc gccatc 36 

<210> 229 
<211> 36 
<212> DNA 
<213> Synthetic 

<400> 229 

tatatatgcg gccgccttct cctggagcgc tgcgcc 36 

<210> 230 

<211> 36 

<212> DNA 

<213> Synthetic 

<40O> 230 

tatatatgcg gccgccagct tctcctggag cgctgc 3€ 



<210> 231 
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